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Abstract 


An  implementation  of  a  compiler  for  the  SUF  System 
Language  to  produce  code  for  the  PDP-11  family  of  computers 
is  presented.  The  need  for  a  high  level  system  language  for 
minicomputers  is  discussed,  along  with  the  reasons  for 
believing  that  the  STT^  System  Language  is  suitable  as  an 
implementation  language  for  minicomputers. 

The  PDP-11  coniDiler  for  the  System  Language  vas 
generated  by  modifying  the  existing  System/360  compiler  for 
the  SUP  System  Language.  The  similarities  and  the 
differences  between  the  two  compilers  are  described. 

A  discussion  of  the  efficiency  of’  the  object  code 
produced  by  the  PDP-11  compiler  is  included,  along  with  some 
suggestions  on  alternative  code  sequences  which  might  have 
been  used.  In  conclusion,  the  possibility  of  usina  the  SU17 
System  Language  as  an  implementation  language  for  other 
minicomputers  is  discussed. 
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1  Introduction 


Even  though  the  value  of  a  high  level  system  language 
is  generally  recognized,  high  level  system  languages  for 
minicomputers  have  not  been  available  until  recently.  We 
feel  that  a  system  language  for  minicomputers  should  be 
sufficiently  machine  independent  and  powerful  so  as  to  be 
used  on  a  reasonably  large  number  of  minicomputers.  In  our 
opinion,  the  SUE  System  Language  meets  the  above  criteria. 
Accordingly,  a  SUE  System  language  compiler  for  the  pup-11 
family  of  computers  has  been  develoned. 

In  the  following  chapters,  we  will  describe  the  SUE 
System  Language,  our  reasons  for  believing  that  it  is 
suitable  as  a  system  language  for  minicomputers,  as  well  as 
our  implementation  of  a  compiler  for  the  language  to  produce 
code  for  the  PDP-11.  We  hope  to  convince  the  reader  of  the 
appropriateness  of  our  choice  of  the  SITE  System  Language, 
and  to  give  some  indication  of  the  success  of  the  compiler 
in  producing  software  for  the  PDP-11.  In  addition, 
throughout  the  thesis,  we  will  discuss  the  feasibility  of 
implementing  compilers  for  the  SUE  System  Language,  for 
minicomputers  other  than  the  PDP-11. 


—  f>— 
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1  minicomputers 


Although  minicomputers  have  been  available  for  many 
years,  the  full  range  of  their  applicability  to  all  aspects 
of  computing  is  just  recently  being  realize!. 
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Several  functional  characteristics  of  mini 
also  in  the  mini  category.  The  most  important 
terms  of  throughput,  is  the  word  length  of  the 
word  length  most  common  to  recent  machines 
although  minicomputers  are  also  available  with 
8,  12,  and  10  bits. 


comp 

u  te 

rs 

are 

of  t 

hes 

e. 

in 

mach 

ine 

• 

^he 

is 

16 

bi 

ts. 

word 

si 

zes 

of 

Traditionally,  the  memory  sizes  of  minicomputers  have 
been  relatively  small.  Basic  minicomputer  systems  usually 
have  4096  or  8092  words  of  memory,  although  in  general, 
minicomputers  can  be  expanded  into  much  larger  memory  sizes. 

In  the  past,  the  instruction  sets  of  minicomputers  have 
tended  to  be  relatively  unsophisticated.  This  was  brought 
about  partly  by  the  small  word  size  of  the  machine,  and 
partly  in  an  effort  to  keep  the  cost  of  minicomputers  low 
(PDP-8).  Recent  advances  in  .solid  state  circuit 
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techn olog ies  have,  however,  allowed  the  instruction  sets  of 
newer  minicomputers  bo  become  quite  sophisticated.  Hardware 
multiply,  divide,  and  floating  point  instructions  are 
typically  not  included  in  the  instruction  repertoire  of 
minicomputers,  in  an  effort  to  keep  the  cost  of  the  basic 
machine  down.  One  can  obtain  multiply,  divide,  and  floating 
point  units  for  an  extra  cost. 


rven 

though 

the  internal  speeds  of  minicomputers  are 

compa  rable 

to 
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larger  machines. 

they  are 

n  ever thele 

ss 

much 

slowe  r 

(less  throughput) 

than  larger 

processors,  mainly  because  of  short  wordlengths. 
Consequently,  performing  some  benchmark  ^ask  on  a  small 
machine  requires  more  memory  cycles  than  the  same  task  would 
require  on  a  larger  (longer  wordlength)  machine. 

Because  of  their  low  cost,  minicomputers  have  been  used 
for  process  control,  message  switching,  or  to  gjive  greater 
flexibility  to  a  larger  machine  by  providing  remote  job 
entry,  or  time  sharina. 

Minicomputers  may  also  be  an  aid  in  the  teaching  of 
computer  science.  Machine  organization  can  probably  be 
taught  easier  on  a  small  machine  on  account  of  the 
simplicity  of  the  hardware.  Assembler  language  programming 
could  also  be  taught  on  a  minicomputer  at  a  lower  cost. 
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Another  aspect 
greatly  from  t:he  use 
operating  system 
principles  involved  i 
may  be  discussed  i 
these  ideas  to  a  part 
under  stand ing . 


of  computer  science  which  can  benefit 
of  minicomputers,  is  the  teaching  of 
principles.  The  structure,  and  the 
n  the  design  of  an  operating  system, 
n  a  classroom,  but  some  application  of 
icular  machine  is  essential  for  full 
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Most  large  computer  centres  are  unwilling,  or  perhaps 
e,  to  allow  programs  which  are  inherently  dangerous  to 
fficient  operation  of  their  system,  to  operate  on  their 
ne.  A  minicomputer,  however,  would  allow  a  student  to 
te  in  a  hands-on  environment,  possibly  for  the  first 
More  imDortant  perhaps,  by  allowing  exclusive  use  of 
machine  to  each  student  in  turn,  errors  in  program 
n  will  only  effect  the  particular  user,  and  not  all 
of  a  large  system. 


1 . 2  The  Peed  for  High  Level  languages  for  Minicomputers 


Unfortunately,  the  evolvement  in  the  miniaturization  of 
hardware  which  has  brought  about  the  recent  advancements  in 
minicomputer  design,  has  not  been  paralleled  by  the 
development  of  software.  Systems  software  for  minicomputers 
is  generally  written  in  the  assembler  language  of  the  object 
machine.  We  felt  that  some  alternative  means  of  software 
development  for  minicomputers  was  necessary. 
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Sammet  [ Sam met  1971]  has  clearly  indicated  the 
advantages  of  a  high  level  language  for  software 
implementation.  "Briefly  they  are: 

1)  Good  potential  for  conversion  to  another  machine. 

2)  Save  calendar  and/or  manpower  time. 

3)  Fasier  to  maintain  and  document. 

a)  Fasier  to  maintain  project  because  of  greater 

flexibility  in  having  a  person  pick  up  someone  else’s 
work . 


5)  Greater  ability  to  understand  the  overall  system, 
resulting  in  better  performance  through  the  ease  of 
fundamental  redesign. 

These  advantages  are  applicable  not  only  to  software 
written  for  large  machines;  however,  except  for  a  few 
exceptions,  high  level  system  languages  for  minicomputers 
are  at  the  present  not  available. 


The  reasons  behind  the  use  of  low  level  languages  for 
minicomputer  software  development  were  the  following. 
First,  on  account  of  their  low  cost,  minicomputers  have 
traditionally  been  applied  to  small,  dedicated  tasks.  As  a 
result,  it  was  felt  that  the  software  would  only  have  to  be 
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written  once,  and  the  best  possible  code  sequence  should 
have  been  used.  Secondly,  early  minicomputers  had  such 
unsophisticated  instruction  sets,  that  it  was  felt  that 
compilers  for  high  level  languages  could  not  produce  as  good 
code  as  a  programmer  familiar  with  the  idiosynchrasies  of 
the  machine.  Thirdly,  since  minicomputers  were  often  used 
for  real  time  applications,  it  was  felt  that  the  code 
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compiler 
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lang 

uage  could 

not 
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good 
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language 
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not  m 
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size  restraints  of  minicomputers. 


The  new  generation  of  minicomputers,  however,  provide 
the  capability  of  using  a  high  level  language  for  software 
development.  The  instruction  sets  of  some  .minicomputers 
have  evolved  to  the  point  of  being  as  elegant,  if  not  more 
so  than  some  larger  machines.  Minicomputers  have  become 
versatile  enough  to  handle  problems  large  enough,  so  that 
the  under standability  of  programs  becomes  an  important 
consideration.  To  achieve  low  cost,  the  memory  sizes  of 
minicomputers  are  still  small  in  comparison  to  those 
available  on  larger  machines,  and  perhaps  too  small  to 
support  a  compiler  for  a  good  high  level  language.  It  is, 
however,  becoming  standard  practice  to  compile  programs  to 
be  run  on  a  minicomputer,  on  a  large  machine,  and  to  then 
transfer  the  object  code  produced  onto  the  minicomputer  for 
execution  r Waks  1972].  The  actual  minicomputers  are 
configured  to  run  programs,  not  to  compile  them. 
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1 . 3  Fxisting  High  Level  Languages  for  minicomputers 

At  the  present  time,  most  minicomputers  have  at  least 
one  compiler  for  a  high  level  language-  Frequently,  either 
BASIC  and/or  FORTRAN  has  been  chosen  as  that  language,  on 
account  of  their  widespread  use  and  ease  of  implementation 
of  a  specific  subset  of  the  language.  rven  though  systems 
software  has  been  written  in  FORTRAN  by  the  addition  to  the 
language  of  hardware  dependent  features  [Dubois  1971],  it  is 
not  suitable  for  systems  implementation. 

The  Burroughs  family  of  computers  are  designed  with  the 
ALGOL  lanquaqe  in  mind,  and  systems  software  is  implemented 
in  an  extended  version  of  ALGOL.  A  great  deal  of  time  and 
manpower  would  be  needed  to  design  and  implement  extensions 
to  the  ALGOL  languaoe  required  to  meet  the  demands  of  each 
particular  machine. 

The  PUSS  system  implementation  language  [  Fulf  197  11, 


written  for  the 

PPP-10,  has 

been  in  active  use 

for 

seve  ral 

years.  During 

that  time. 

a  fairly  large  numb 

er  of 

systems 

have  been  written  in  BLISS, 

including  several 

compil 

ers ,  a 

conversational 

programmina 

lang uage (API) ,  an 

I/O 

support 

system,  and  many  other  applications.  The  BLISS  language 
provides  very  elegant  control  structures,  and  excludes  the 
goto  from  the  language.  The  data  structuring  capabilities  of 
the  language  are  both  general,  and  powerful,  an  essential 
requirement  for  a  systems  implementation  language. 
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£  version  of  the  BLISS  language  designed  to  produce 
software  for  the  PDP-11  has  lust  become  available.  Several 
objections  come  immediately  to  mind.  The  BLISS  language  was 
designed  with  the  PDP-10  in  mind,  and  the  resulting  machine 
dependency  is  present  in  a  major  part  of  the  language. 
BLISS  is  a  typeless  language,  since  the  PDP-10  is  a  typeless 
machine,  and  any  field  up  to  36  bits  in  length  may  be 
accessed  directly.  Since  the  PDP-11  allows  addressablit y 
only  to  8  bit  bytes,  and  16  bit  words,  the  addressing 
canabilities  of  BLISS  for  the  PDP-11  implementation  had  to 
be  restricted  to  a  byte  or  a  word.  Since  the  data 
structuring  capabilities  of  the  BLISS  language  are  built 
around  the  ability  to  address  variable  length  fields,  the 
above  restriction  seems  a  serious  change  to  the  original 
BLISS  language  definition. 

A  more  immediate  problem  is  the  availability  of  the 
PDP-10.  Since  the  BLISS-11  compiler  is  written  in  BLISS-10, 
a  PDP-10  of  sufficient  si.ze(60K)  must  be  available  to 
compile  programs  for  the  PDP-11. 
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1 . 4  The  SUE  System  Language 


Project  SUE  [Atwood  et  al  1972]  is  a  project 

established  at  the  Computer  Systems  Research  Group, 
University  of  Toronto.  Its  goal  is  the  design  and 

implementation  of  an  extendable,  efficient,  and  reliable 
time  sharing  operating  system  for  the  System/360/370  family 
of  computers.  It  was  felt  that  the  implementation  of  system 
components  would  be  unmanagable  in  other  than  a  high  level 
language.  Accordingly,  a  language  and  its  compiler  [Clark 
1971]  have  been  developed. 


As  stated  by  Clark,  the  requirements  for  the  system 
language  were  the  following: 

1)  The  System  Language  must  facilitate  nicely  structured 
programs  and  data.  It  must  be  convenient  to  write 
compact  programs  whose  control  structure  is  readily 
apparent  and  amenable  to  demonstrations  of  correctness. 

2)  The  System  Language  must  be  readable. 


3)  The  System  Language  must  actively  assist  in  the 
detection  and  isolation  of  bugs  and  logical  errors. 


4) 


The  System  Language 
System/360  code.  This 
dependent  as  it  might 


must  be  compilable  into  efficient 
requirement  is  not  as  machine 
seem  to  be  at  first  glance,  since 
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it  involved  the  exclusion  of 


"oowerful”  constructs 

L. 


which  inherently  produce  bad  code. 


5)  In  order  to  meet  the  requirements  of  software  written 
in  a  systems  language,  facility  need  be  provided  for 
addressing  absolute  machine  locations,  and  for  emitting 
specific  instructions  under  programmer  control. 


At 

designed , 
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1.4.1  Language  Description 

The  following  paragraphs  comprise  a  summary  of  the  SU^ 
System  Language  with  respect  to  its  data  structuring 
capabilities,  control  structures,  and  overall  program 
organization.  For  a  complete  description,  see  [Clark  1972j. 


1.4. 1 . 1  Compilation  Structures 


During  the  design  of  the  SUE  System  Language,  it  was 
felt  that  a  correct  system  of  programs  could  be  produced 
more  efficiently  if  changes  to  a  module  would  not  require 
the  recompilation  of  uneffected  modules.  Consequently,  the 


System  Language  is  designed  to  facilitate  the  separate 
compilation  of  hierarchically  related  programs.  A  system  of 
programs  consists  of  one  or  more  compilation  blocks.  Three 
types  of  compilation  blocks  are  available:  context  blocks, 
data  blocks,  and  program  blocks. 


A  system  of  programs  in  the  SUE  System  Language  will 
consist  of  one  or  more  procedures.  Procedures  are  declared 
in  their  enclosing  scopes.  A  procedure  declaration  contains 
only  the  information  needed  to  compile  calls  on  that 
procedure.  Each  procedure  has  associated  with  it,  both  a 
data  block  and  a  program  block.  A  data  block  contains  the 
names  of  the  parameters  and  return  values  of  the  procedure 
with  which  it  is  associated,  as  well  as  declarations  for 
local  procedures,  local  variables  which  are  to  be  shared 
with  these  procedures,  type  declarations,  and  macro 
definitions.  A  ££2SL3.HL  block  contains  the  strictly  local 
declarations  of  the  procedure,  as  well  as  its  executable 
code.  A  context  block  is  similar  to  a  data  block,  but  is 
restricted  to  constant  declarations  (e.g.  types)  which  are 
to  be  shared  by  the  procedures  of  cooperating  processes. 
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1.4. 1.2  Data  Structures 


The  data  structures  of  the  SUE  System  Language  most 
closely  resemble  those  of  the  language  PASCAL  [wirth  1971], 
The  user  may  define  his  own  data  types  by  the  symbolic  names 
of  all  the  constants  belonging  to  that  type,  and  thereafter 
the  mapping  of  the  data  type  onto  machine  locations  is  the 
responsibility  of  the  compiler.  For  example: 

t_ype  Colour  =  (ped ,  Blue,  Green)  ; 

Structured  user  defined  data  tvoes  are  also  available 

j.  L 

in  the  language.  These  include  arrays,  records,  groups, 
pointers,  and  powersets. 

The  introduction  of  user  defined  data  types  result  in 
two  advantages.  First,  it  hands  the  compiler  extensive  type 
checking  capabilities.  Accordingly,  operations  on  data 
types  which  have  no  logical  relation  between  them  are 
disallowed.  Secondly,  user  defined  data  types  imply  very 
little  machine  dependency,  since  it  is  the  responsibility  of 
the  compiler  to  map  the  data  type  onto  the  object  machine. 

Strictly  numeric  types  are  defined  to  be  of  type 
(<number>  to  <number>)  where  both  the  lower  bound  and  the 
upper  bound  of  the  numeric  type  are  explicitly  specified. 
Once  again,  it  is  the  responsibility  of  the  compiler  to  map 
the  numeric  type  onto  the  machine.  ?or  any  implementation, 
there  must  exist  a  machine  enforced  upper  and  lower  limit  on 
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the  values  of  numeric  types.  However,  these  limits  are  not 
requirements  of  the  STTF  System  Language,  but  of  the  object 
machine . 

1 . 4.  1 . 3  Control  Structures 

The  control  structures  of  the  SUE  System  Language  were 
designed  with  the  ease  of  understandability  and 

implementation  in  mind.  The  control  structures  allow 
repetition  as  provided  by  the  cycle  and  do  constructs,  and 
selection  as  provided  by  the  case  and  if  constructs.  No 
goto  construct  is  available  in  the  language. 

The  cycle  construct  provides  the  capability  of  an 
unbounded  loop  with  termination  of  the  loop  provided  by  the 
exit  construct.  Partial  execution  of  a  loop  is  provided  by 
the  repeat  construct  which  causes  a  transfer  to  the  head  of 
the  loop.  For  example: 

declare 

(  0  to  3)  (I)  ; 

T  :  =  0  ; 

cycle 

I  :  =  I  +  1 ; 
exit  when  1=3 

end 
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Bounded  repetition  is  provided  hy  the  do  construct  in 
which  a  control  variable  assumes  a  sequence  of  values  during 
the  repeated  execution  of  a  statement  list.  For  example: 

do  I  : =  1  to  3; 

end 

Selection  is  provided  by  the  case  construct.  The  case 
construct  selects  a  particular  statement  list  from  an  array 
of  statement  lists  for  execution.  The  alternative  statement 
lists  are  labelled  by  constants  of  a  data  type  specified  in 
the  header  of  the  case  construct,  and  selection  is  based  on 
an  expression  of  this  so  called  tag  type.  The  if  construct 
is  a  specialized  form  of  the  case  construct  in  which 
selection  is  based  on  a  Boolean  expression.  For  example: 

declare 

Colour  (Selector)  ; 

Selector  :=  Blue; 

case  Colour  tag  Selector; 

Fed:  T  :=  1; 

Green:  T  :=  2; 

Blue:  I  :=  3 

end 
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1 • 5  &n  Implementation  Language  For  Minicomputers 


Fecognizing  the  need  for  a  systems  language  for 
minicomputers,  we  felt  that  the  SUE  System  Language  would 
serve  our  purposes  for  the  following  reasons: 

1)  The  SUE  System  Language  is  sufficiently  powerful  and 
flexible  to  provide  systems  software  for  a  reasonably 
large  class  of  minicomputers. 

2)  Nost  of  the  data  structures  of  the  SUE  System  Language 
are  machine  independent. 

3)  There  existed  a  compiler  for  the  SUE  System  Language  to 

produce  code  for  the  System/360/370  family  of 

computers.  Ue  felt  that  too  much  time  and  effort  would 
have  been  needed  to  implement  a  SUE  compiler  to  run  on 
a  minicomputer.  Consequently,  the  decision  was  made  to 
have  the  SUE  compiler  for  a  minicomputer  execute  on  the 
System/360.  The  general  availability  of  the  System/360 
makes  it  a  good  choice,  and  it  was  felt  that  the 
existing  compiler  could  be  modified  at  a  lower  cost  in 
time  and  manpower  than  to  completely  redesign  the 
compiler. 

To  prove  the  feasability  of  using  the  SUE  System 
Language  as  a  systems  language  for  minicomputers,  a  POP- 11 
compiler  for  the  SUE  language  was  developed.  The  PUP- 1 1  was 
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chosen  for  several  reasons.  It  belongs  to  the 


new 


generation  of  minicomputers.  It  may  be  expanded  from  a 
basic  model  (4K  of  core  and  a  teletype)  up  to  56K  words  and 

a  variety  of  peripheral  equipment.  The  hardware 

architecture  makes  it  a  very  nice  machine  for  the  compiler 

writer,  and  for  the  demonstration  of  operating  system 

principles.  Finally,  the  most  pragmatic  reason  was  the 
availability  of  a  PPP- 11/20  for  our  use.  The  general 
availability  of  the  PnP-11  leads  us  to  believe  that  our 
choice  was  a  good  one. 


The  following  chapters  of  this  thesis  will  discuss  the 
design  considerations  and  the  implementation  of  the  STJF 
System  language  compiler  for  the  PDP-11.  Chapter  2 
describes  the  machine  architecture  of  the  PDP-11,  and  points 
out  those  features  which  are  used  in  the  implementation. 
Chapter  3  describes  the  structure  of  the  SU17  System  Language 
compiler  for  the  PDP-11.  It  also  includes  a  discussion  on 
the  similarities  and  differences  between  the  PDP-11  compiler 
and  the  System/360  compiler.  Chapter  4  describes  the  run 
time  organization  of  the  PDP-11  compiler.  It  includes 


sections 


on 


data  representation,  run  time  storage 


organization,  procedure  linkage,  and  control  structure 
implementation.  Chapter  5  concludes  the  thesis  with  a 
discussion  of  the  success  of  the  PDP-11  compiler  in  terms  of 
the  code  produced,  and  the  software  that  has  been  written 
using  it,  as  well  as  some  thoughts  on  the  practicality  of 
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using  the  SU3!  System  Language  to  produce  software  for  other 
minicomputers. 


!  L 


. 
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2  PDP-11  Machine  Structure 


2.  1  Introduction 


The  SUE- 1 1  compiler  was  designed  to  produce  software 
for  the  PDP-11  family  of  computers,  with  particular  emphasis 
on  the  PDP-11  model  20.  This  chapter  contains  a  description 
of  the  PDP-11  machine,  so  as  to  give  the  reader  a  better 
understanding  of  the  implementation  design.  It  contains 
sections  describing  the  overall  machine  architecture,  a 
brief  description  of  those  instructions  common  to  all  models 
of  the  PDP-11  family,  and  a  description  of  the  stack 
processing  capabilities  of  the  machine.  The  chapter  is 
concluded  with  a  discussion  of  how  the  emphasis  on  the  PDP- 
11/20  effected  the  design. 

The  PDP-11  has  a  16  bit  word  length,  and  is  byte 
addressable.  Byte  addresses  are  numbered  consecutively 
starting  at  0  and  word  addresses  are  restricted  to  be  even, 
or  an  interrupt  will  occur.  Addresses  0-255  are  reserved 
for  the  various  trap  vectors  of  the  machine,  and  should  not 
be  used  as  either  program  or  data  area. 

The  PDP-11/20  is  equipped  with  eight  16  bit  general 
purpose  registers  referred  to  as  (B0,F.  1 , .  .  .  , F7)  .  "Register  7 


serves  as 

the 

instruction 

counter 

of  the 

mach ine; 

that 

is. 

it  contains 

the 

address  of 

the 

next  instruction 

to 

be 

executed. 

Pegister  6  has 

the 

special 

function 

of 

the 

-23- 


processor  stack  pointer  (see  section  2.5)  .  The  general 
purpose  registers  ip  ay  all  be  used  as  accumulators,  index 
registers,  or  stack  pointers. 

The  central  processor  status  register  (PS)  contains 
information  regarding  the  priority  of  the  processor.  The  PS 
is  directly  addressable,  and  it  may  be  set  to  any  one  of 
eight  priority  levels  under  programmer  control.  Processor 
priority  is  held  in  bits  5,6,  and  7  of  the  PS.  The  result 
of  the  execution  of  most  instructions  causes  bits  0, 1,2,3 
(referred  to  as  the  C,V,Z,N  bits)  to  be  set  or  cleared.  The 
bits  are  set  as  follows: 
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1  not  used 

priority 

T  N  Z  V  C 

.11. 

.  1  1  1  1  

Figure  2.1-1  Processor  Status  Register 


2 . 2  Addressing 

All  addressing  in  the  PDP-11  is  done  through  the 
general  purpose  registers.  A  PDP^II  instruction  word 
contains  addresses  consisting  of  six  bits,  divided  into  two 
three  bit  parts.  The  first  three  bit  part  indicates  the 
address  mode  to  be  used,  and  the  last  three  bits  indicate 
the  register  through  which  the  address  calculation  is  to  be 
made  (see  fig.  2.2-1). 


mode 

register 

!  1 

1  I 

Figure  2.2-1  PDP- 1 1  Address 
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2.2.1  Addressina  Modes 


In  order  to  provide  a  more  powerful  instruction  set, 
the  PDP-11  allows  tremendous  flexibility  in  addressing 
capabilities:  the  PDP-11  has  8  modes  of  addressing.  The 
following  table  summarizes  the  addressing  modes  of  the  PDP-* 

1  1. 


Mode  Assembler 
format 


Descriotion 


0  OPF.  Rn  General  register  mode.  The  operand  is 

contained  in  register  Rn. 

1  OPR  (Pn)  Deferred  addressing  mode.  The  address  of 

the  operand  is  contained  in  register  Pn. 


2  OPR  (Pn) +  Autoincrement  addressing  mode.  The 

address  of  the  operand  is  taken  from  Rn 
and  then  the  register  is  incremented  by 
one  or  two  depending  on  the  operation. 


3  OPR  5)  (Rn)  +  Deferred  autoincrement  addressing.  The 

contents  of. the  address  contained  in  Rn 
is  used  as  the  address  of  the  operand, 
and  then  Rn  is  incremented  by  one  or  two 
depending  or.  the  operation. 
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OPR  -  (Pn)  T.utod ecremen t  addressing  mode.  The 

contents  of  Pn  is  decremented  by  one  or 
two  depending  on  the  operation,  and  the 
new  contents  of  Rn  is  used  as  the 
operand  address. 


OPP  a)-  (Pn)  Deferred,  autodecrement  addressing.  The 

contents  of  Rn  is  decremented  by  one  or 
two  depending  on  the  operation,  and  the 
new  contents  is  used  as  the  address 
containing  the  operand  address. 


OPR  X  (Rn)  Indexed  addressing  mode.  The  register  Rn 

is  used  as  an  index  register.  The 
operand  address  consists  of  the  sum  of 
the  contents  of  ^n,  and  an  index  word 
X.  The  index  word  directly  follows  the 
instruction  word. 


7 


OPR  SX(?n)  Deferred  indexed  addressing  mode.  The 


calculated  address  (contents  of  Rn  plus 
the  index  X)  is  used  as  the  address  of 
the  memory  location  containing  the 


address  of  the  operand. 
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The  use  of  register  7  (program  counter)  as  the 
addressing  register  results  in  special  modes. 

Mode  2  with  reaister  7  implies  that  the  word  following 
the  instruction  contains  the  operand.  Constants  may  be 
generated  by  the  use  of  this  mode  (OPR  #n) . 

Mode  3  with  register  7  implies  that  the  word  following 
the  instruction  contains  the  address  of  the  operand. 
Absolute  addresses  may  be  formed  in  this  way  (OPR  . 

Mode  6  with  register  7  specifies  the  operand  address 
relative  to  the  instruction  counter.  If  the  operand  is 
located  at  machine  address  A,  the  index  word  following  the 
instruction  should  contain  (A-address  of  index  word-2) , 
since  fetch  of  the  index  word  causes  the  instruction  counter 
to  be  incremented  by  2  (OPR  A)  . 

Instructions  for  both  byte  and  word  address 
manipulations  are  available.  The  byte  addressing  scheme  of 
the  PPP-11  is  worthy  of  mention.  Consecutive  byte  addresses 
do  not  access  contiguous  bytes.  If  N  is  a  word  address, 
then  the  use  of  a  byte  instruction  to  reference  N,  results 
in  access  to  the  low  order  byte  of  the  word  at  N.  The  use 
of  a  byte  instruction  to  reference  N+ 1 ,  results  in  access  to 
the  the  high  order  byte  of  the  word  at  N.  Thus  the 
application  of  a  byte  operation  to  a  register,  causes  the 
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low  order  byte  of  the  register  to  be  involved  in  the 
operation . 


The  use  of  byte  instruct 
autodecrement  addressing  inodes  ca 
to  be  stepped  by  1  to  point  at 
use  of  word  instructions,  causes 
by  2 . 


ions  with  autoincrement  or 
uses  the  specified  register 
the  next  byte  of  data.  The 
the  register  to  be  stepped 


2.3  Instruction  formats 


The  basic  i 
consisting  of  an 
followed  by  one 
however,  have  var 
type.  The  follow 
the  instruction 
implementation  de 


nstruction  length  of  the  PDP-11  is  16  bits, 
opcode,  and  address  modes,  and  may  be 
or  two  index  words.  The  operation  codes 
iable  lengths  depending  on  the  instruction 
ing  section  contains  a  brief  description  of 
formats  occurring  frequently  in  the 
script  ion. 


2.3.1  Double  Operand 


A  double  operand  instruction  consists  of  an  operation 
code,  a  source  address,  and  a  destination  address  (^ig. 

2.  3.  1-1)  . 
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opcode 

dst  address 

src  address 

111. 

.  i  I  1  1  1  . 

.11111.... 

figure  2. 3. 1-1  Double  Operand  Instruction  format 


Source  address  calculations  precede  destination  address 
calculations,  and  as  a  result,  if  indexed  mode  is  used  for 
both  addresses,  the  index  word  for  the  source  address 
precedes  the  index  of  the  destination.  The  double  operand 
instructions  which  operate  on  word  data  are  the  following: 


MOV  src,  dst 


Move  the  source  operand  to  the  destination 
address. 


ADD  src,  dst  Add  the  source  operand  to  the  destination 

operand,  and  store  the  result  at  the 
destination  address. 


SUB  src,  dst 


Subtract  the  source  operand  from  the 
destination  operand  and  store  the  result  at 
the  destination  address. 


CMP  src,  dst 


Subtract  the  destination  crerand  from  the 

- - - ^ 

source  operand  and  set  the  condition  codes. 
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according  to  whether  the  result  is  less 
than  zero,  greater  than  zero,  or  equal  to 
zero.  Both  the  destination  and  the  source 
operands  are  left  unchanged. 


BIS  src,  dst  Perform  "inclusive  or”  between  the  source 

and  destination  operands,  and  store  the 
result  at  the  destination  address. 


3IC  src,  dst  Fach  bit  in  the  destination  that 

corresponds  to  a  set  bit  in  the  source,  is 
cleared . 


BIT  src,  dst 


Perform  logical  "and”  comparison  of  the 
source  and  destination,  and  set  the 
condition  codes  accordingly.  ^oth  operands 
remain  unchanged. 


Analogous  double  operand  byte  instructions  are 
available  for  all  operations  except  add  and  subtract.  Thus, 
all  arithmetic  must  be  applied  to  fullword  quantities.  The 
conversion  of  byte  quantities  to  fullword  (Bit (16))  is 
accomplished  by  the  use  of  the  move  byte  (I^OVE)  instruction. 
If  the  destination  address  of  the  MOVB  instruction  is  one  of 
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2.3.2  Single  Operand 

A  single  operand  instruction  consists  of  an  operation 
code  followed  by  an  address. 


opcode 

dst  address 

LI  1  Li  Li  1  1  . 

.  1  _ 1_  1  1 

Figure  2. 3. 2-1  Single  Operand  Instruction  Format 


Byte  and  word  instructions  exist  for  zeroing,  incrementing, 
decrementing,  negating,  etc.  any  addressable  machine 
location  [Digital  1 96  Q  1  - 


2.3.3  Branch  Instructions 

A  branch  instruction  consists  of  an  operation  code 
followed  by  an  eight  bit  offset.  The  offset  is  treated  as  a 
signed  two's  complement  displacement  to  be  multiplied  by  two 
and  added  to  the  program  counter.  Since  the  program  counter 
points  to  the  word  following  the  branch  instruction,  the 
effect  is  to  cause  the  next  instruction  to  be  taken  from  an 
address  located  up  to  127  words  back,  or  128  words  ahead  of 
the  branch  instruction. 
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operation  code 


offset 


Figure  2.3. 3-1  Pranch  Instruction  Fo r m a t 


A  more  flexibl 
instruction.  The  form 
to  that  of  a  singl 
the  instruction  JMP  d 
destination  address, 
register  mode  may  be 


e  branch  is  provided  by  the  jump  (JMP) 
at  of  the  jump  instruction  is  identical 
e  operand  instruction.  On  execution  of 
str  the  program  counter  is  set  to  the 
All  addressing  modes  except  general 
used  in  conjunction  with  a  jump. 


2.U  Arithmetic 


All  arithmetic  on  the  PD*5- 1  1  is  done  in  t* 
complement  form,  and  both  words  and  bytes  have  a  sign  in 
high  order  bit.  Since  both  source  and  destination  addres 
may  refer  to  any  machine  address,  arithmetic  operations 
be  applied  to  any  addressable  memory  location.  The  gene 
purpose  registers  are  thus  free  to  act  as  fast  accumulato 
stack  pointers,  and  temporaries. 
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2 . ^  Stack  Processing 


Stack  processing  capabilities  are  provided  to  users  of 
the  PDP-11  by  the  autodecrement  and  autoincrement  addressing 
modes.  The  autodecrement  mode  used  with  register  Rn,  causes 
the  contents  of  Rn  to  be  decremented,  and  the  new  contents 
cf  Rn  are  used  as  the  onerand  address.  ^hus,  if  Pn  contains 
the  address  of  the  top  of  a  stack,  the  execution  of  the 
instruction  MOV  R0,-(Pn)  would  cause  the  stack  to  be 
pushed,  and  the  contents  of  PO  would  become  the  top  element 
of  the  stack. 


Conversely,  aut 
as  the  operand  addres 
instruction  MOV  (Rn) 
stack  into  RO,  and  th 
grow  toward  address  0 


oincrement  mode  uses  the  contents  of  Rn 
s,  a nd  then  pn  is  incremented .  The 
+,s0  would  place  the  top  element  of  the 
en  pop  the  stack.  Stacks,  in  general. 


As  mentioned  in  section  2.1,  register  6  has  the  special 
^unction  of  processor  stack  pointer.  In  the  PDP-11,  a  stack 
is  automatically  maintained  by  the  hardware  for  interrupt 
processing.  This  stack  is  addressed  through  register  6. 
Whenever  a  processor  interrupt  occurs,  the  processor  pushes, 
first  the  central  processor  status  (PS)  and  then  the 
instruction  counter  (PC)  onto  the  processor  stac1'.  The  new 
PS  and  PC  are  taken  from  a  memory  location  dependent  on  the 
interrupt. 
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Since  register  6  is  thus  used  for  interrupt  processing, 
and  register  7  as  the  instruction  counter,  auto  in crement  and 
autodecrement  modes  using  the  tvo  registers  are  always  done 
in  steps  of  two.  Pyte  operations  on  these  two  registers 
simply  leave  odd  addresses  unmodified. 

2 . 6  Conclusions 

In  the  design  of  the  SUP  compiler  for  the  PDP-11,  we 
intended  that  programs  Droduced  by  the  compiler  could  run  on 
any  machine  in  the  PDP-11  family.  Consequently ,  our 
implementation  has  used  only  that  subset  o^  the  instruction 
set  common  to  all  pnn-'ji  machines.  This  has  resulted  in 
several  shortcominas  in  our  implementation.  •  Multiply, 
divide,  and  exclusive  or,  for  instance,  are  not  supported  by 
our  implementation  since  they  are  available  as  standard 
hardware  instructions  only  on  the  more  sophisticated  models 
of  the  PDP-11  family.  In  a  similar  vein,  the  group  data 
structure  of  the  SUP  language  was  not  implemented  due  to  the 
lack  of  a  multiple  shift  instruction. 

We  feel  that  these  restrictions  have  not  greatly 
effected  the  scope  of  the  use  of  the  compiler.  If  the  extra 
hardware  is  available,  the  above  features  can  be  implemented 
in  a  short  period  of  time,  since  the  tools  needed  \  to 
implement  them  are  already  available  in  the  compiler. 
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3 .  Compiler  Structure 
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3.1  Introduction 


The  System/360  compiler  for  the  SUP  System  Language 
(STn?-360)  was  written  using  the  XPL  translator  writing 
system  f McKeeman  et  al  1970],  and  the  PPL  language  [Rudmik 
1972],  and  using  the  LALS  technique  [Lalonde  1971]  to 
generate  the  parsing  tables  for  the  SOP  grammar.  (PPL  is  an 
extended  version  of  XPL  which  allows  controlled  storage, 
records,  pointers,  and  includes  extensive  optimizations)  . 
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The  System/360  compiler  (50^-360)  can  be  divided  into 
distinct  sections.  The  first  section,  usually  referred 
s  the  front  end  of  a  compiler,  handles  such  functions  as 
ning,  paragraphina  source  text,  macro  expansion,  symbol 
e  routines,  tyre  checking,  and  parsing.  The  second 
ion,  usually  referred  to  as  the  code  emitter,  is 
onsible  for  producing  object  code  for  the  target 
ine.  The  code  emission  routines  are  machine  dependent: 
is,  their  structure  and  the  algorithms  used  depend  on 
object  machine.  The  front  end  is  only  lanauage 
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dependent,  its  structure  determined  by  the  source  language 
to  compile. 

In  order  to  generate  a  SUE  compiler  to  produce  code  for 
the  PDP-11  (SUE- 11)  in  a  reasonable  length  of  time,  we 
decided  to  use  as  much  of  the  existing  SUE-360  compiler  as 
possible.  Extensive  modification  of  any  ’’large1’  program  by 
someone  other  than  the  oriainal  programmer  always  involves 
an  initial  overhead,  namely  the  time  required  to  understand 
the  existing  program  logic.  In  our  case,  close  to  50%  of 
our  initial  design  period  was  spent  in  becoming  familiar 
with  the  existing  compiler.  If  the  program  is  well  written 
in  a  high  level  language,  this  initial  overhead  can  be  small 
in  comparison  to  the  time  required  to  redesign  usable 
sections  from  scratch. 


The  structure  of  the  SU^-II  compiler  is  essentia  lly  the 
same  as  that  of  the  original  SUE^360  version.  The  front  end 
of  the  compiler  is  in  fact  almost  identical.  The  scanner, 
the  paragrapher,  the  parser,  the  macro  expander,  and  the 
type  checker  are  left  intact.  The  symbol  table  routines 
were  changed  only  to  accomodate  differences  in  the 
addressing  structure  of  the  PDP-11  implementation.  A 
description  of  the  original  routines,  and  the  flow  of 
information  between  them,  may  be  found  in  [Clark  1971], 


In  considering  an  organization  for  the  code  emission 
section  of  the  compiler,  we  came  to  the  conclusion  that  the 
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existing  overall  structure  of  the  SUP-360  code  emitters  were 
to  a  large  extent  machine  independent.  We  thus  decided  to 
adopt  their  structure  as  much  as  possible.  This  has 
resulted  in  several  advantages.  The  implementation  of  the 
SUP-11  code  emitters  progressed  at  a  much  faster  rate  due  to 
the  similarity  of  structure  to  the  SUP-360  compiler.  During 
our  implementation,  the  close  scrutiny  of  the  SUP-360 
compiler  uncovered  some  existing  errors  which  might  not  have 
become  obvious  for  some  length  of  time.  In  a  similar 
fashion,  problems  in  our  implementation  had  often  been 
Dreviously  overcome  in  the  SUP-360  implementation.  The 
final  external  structure  of  the  code  emission  routines  for 
both  compilers  are  similar  to  the  extent  that  we  feel  that 
both  compilers  can  be  maintained  by  a  person  familiar  with 
one  or  other  of  the  compilers,  and  with  both  machines. 

4.  W 

3. 2  The  Structure  of  the  Code  Pmission  Poutines 

The  organization  of  the  code  emitter,  and  the  flow  of 
information  between  the  various  routines  comprising  the  code 
emitter,  are  outlined  in  ^igure  3.2-1.  The  Followina 
paragraphs  describe  the  function  of  each  major  module. 

The  control  program  is  driven  by  the  syntactic 
productions  recognized  by  the  parser.  Whenever  a  syntactic 
entity  is  recognized,  the  control  program  updates  compile 
time  data  structures,  and  where  necessary,  decides  on  the 
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figure  3.2-1  Code  Fmitter  Organization 


type  of  code  to  be  emitted.  The  control  program  calls  the 
module  responsible  for  the  code  emission  of  the  class  of 
productions  to  which  the  recognized  production  belongs.  The 
compiler  has  extensive  type  checking  capabilities,  and  it  is 
the  responsibility  of  the  control  program  to  initiate  type 
checking  where  necessary. 

Whenever  a  declaration  statement  is  recognized,  the 
control  program  calls  the  declaration  processor.  The 
declaration  processor  is  responsible  for  the  calculation  of 
machine  addresses  for  variables,  and  for  the  mapping  of  data 
types  onto  the  machine.  The  declaration  processor  decides 
whether  the  variable  may  be  handled  as  an  arithmetic 
quantity  by  the  available  PDP-11  instruction  set.  If  not, 
the  variable  is  flagged  to  indicate  to  the  compiler  that  the 
variable  will  have  to  be  handled  in  a  different  manner. 

Whenever  an  arithmetic  expression  is  recognized,  the 
control  program  activates  the  expression  processor.  The 
function  of  the  arithmetic  processor  is  twofold.  It 
prepares  the  operands  of  the  expression  for  processincr  by 
loading  their  values  as  required.  By  the  time  the  operator 
of  the  expression  is  recognized,  it  is  guaranteed  that  the 
operands  of  the  expression  are  ready  for  the  operation,  and 
the  arithmetic  processor  simply  generates  code  which  yields 
the  value  of  the  expression. 
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An  alternative  method  could  have  been  used. 


This 


method  builds  an  expression  tree,  and  then  walks  the  tree, 
emitting  code  along  the  way.  Better  code  seguences  could 
have  been  produced  by  this  method  for  certain  expressions  at 
the  cost  of  maintaining  the  tree.  We  do  not  expect  the  SUE 
language  to  be  used  frequently  for  complicated  numeric 
operations,  and  thus  the  simplicity  of  the  scheme  used 
outweighs  its  disadvantages. 

The  appearance  of  a  procedure  name  causes  control  to  be 
passed  to  the  procedure  processor.  The  function  of  the 
procedure  processor  is  twofold.  First,  the  procedure 
processor  is  responsible  for  producing  the  prologue  and 
epilogue  to  procedures,  whenever  a  procedure  definition  is 
recognised.  Secondly,  the  occurrence  of  a  procedure  name  in 
an  expression  causes  the  procedure  processor  to  emit  co^e 
for  linking  to  the  procedure.  This  involves  the  setting  of 
base  address  pointers,  and  the  reserving  of  space  for,  and 
the  type  checking  of  the  procedure  parameters. 


The  processing  of  poversets  is  sufficiently  different 
from  the  processing  of  the  usual  arithmetic  expressions,  to 
require  special  routines.  ^bus,  whenever  a  powerset 
operation  is  reguired,  control  is  passed  to  the  powerset 
processor.  It.  is  responsible  for  the  code  generation  of 

for  the  implicit  conversion  of 
type  as  required. 


powerset  operations,  an^ 
scalar  types  to  powerset 
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The  assignment  processor  generates  code  for  the 
assignment  operator.  The  processing  of  the  assignment 
operator  involves  several  functions.  Since  powerset  types 
may  also  be  involved  in  assignment,  the  assignment  Drocessor 
type  checks  whether  the  poverset  assignment  is  legal,  and  if 
necessary,  converts  scalar  types  to  powerset  types, 
"finally,  the  assignment  processor  emits  code  for  the 
assignment . 

The  control  section  processor  handles  the  code 
aeneration  for  all  the  control  structures  of  the  STJW 
language.  Both  selection,  and  exits  -From  a  loop  ar^  managed 
by  the  use  of  compile  time  tables.  These  tables  are  updated 
by  the  control  section  processor,  and  whenever  all  pertinent 
information  is  available,  these  tables  are  used  to  emit 
fixups,  and  branch  tables.  Both  the  management  and  the  use 
of  the  tables  are  machine  independent,  and  were  adopted 
intact  from  the  SU^-lbO  compiler. 

The  object  code  emitter  is  responsible  for  the 
production  of  physical  PDP-11  object  code.  The  produced  code 
is  placed  into  a  code  stack  containing  all  the  code 
generated  for  a  statement,  ^or  readability  of  interlisted 
code,  whenever  the  end  of  a  statement  is  reached,  the  code 
stack  is  popped,  and  the  code  is  dumped  onto  an  output  file. 
F’he  code  stack  provides  the  compiler  with  the  capability  of 
peephole  optimization  within  statements.  A  primitive 


peephole  optimizer,  which  tries  to  optimize  arithmetic 
expressions,  is  include^*  in  the  compiler  (see  section  4.3). 


The  object  code  emitter  produces  its  output  in  a  format 
acceptable  to  the  System/360  linkage  editor.  The  linkage 
editor  acts  as  our  file  system  for  object  programs,  manages 
fixups,  and  inserts  the  entry  point  addresses  for 
procedures.  By  the  use  o4"  the  linkage  editor,  the  patching 
of  branch  addresses  unknown  at  the  time  when  the  branch 
instruction  is  emitted,  and  the  processing  of  external 
references  need  not  be  handled  by  the  compiler.  "^hus,  once 
the  compiler  adds  the  code  for  a  statement  to  the  output 
file,  the  compiler  need  never  again  inspect  that  code. 


The  linkage  editor 
proposition  that  as  much 
available  should  be  used 
is  processed  by  a  specia 
Language  program.  This 
lead  programs  produced  by 
System/3  6  0  machine  tor 
slight  modification  to  th 
object  deck  which  is  then 


processing  was  in  keeping  with  the 
software  as  was  useful  and  alreadv 
.  The  output  of  the  linkage  editor 
ily  written  System/360  Assembler 
program  was  originally  written  to 
the  SUF-360  compiler  onto  the 


execut ion . 

Be 

found 

that  with  a 

is  program. 

we 

could 

oroduce  an 

transferred 

to 

the  PDP- 

11. 

By  using  this  system,  the  output  of  the  SUB- 11  comni.  ler 
is  incom pa table  to  output  produced  by  the  PDP-11  assembler. 
Consequently,  the  loaders  available  for  the  PDP-11  cannot  be 
used  to  load  SUF- 1 1  programs  onto  the  PDP-11.  A  program 
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Figure_3.2-2  Production  o^  a  PDP-11  Load  Module 


written  in  SUF  and  processed  by  the  SUF^II  compiler  (SUF-11 
loader),  loads  any  snr-11  program  onto  the  PDP-11,  relocates 


the  program,  and  then  transfers  control 
in  order  to  run  SUF-11  programs  on  the 
loader  needs  to  be  bootstrapped  onto 
PDP-11  assembler  procrram  may  accomplish 
shows  the  flow  of  information  between 
used  by  this  system. 


to  it.  As  a  result, 
PDP-11,  the  SUF-11 
the  PDP-11.  A  simple 
this.  figure  3.  2-2 
the  various  proarams 


3.3  Conclusions 


As  the  above  sections  indicate,  the  SUF-11  compiler  was 
designed  to  be  as  machine  independent  as  possible.  We  have 
taken  this  premise  to  the  point  of  designing  a  method  of 
linki ng  object  modules  which  does  not  depend  on  the  target 
machine.  The  binding  of  the  object  modules  to  the  target 
machine  is,  in  fact,  handled  by  the  SUF-11  loader.  We  hope 
that  we  have  thus  provided  machine  independent  tools  to  aid 
in  the  quick  implementation  of  SUE  compilers  for  other 
machines . 
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4  Runtime  Organization 

- —  - -T — T- - 

This  chapter  will  describe  the  runtime  organization  of 
the  SUE-11  compiler.  It  contains  sections  describing  the 
data  organization,  the  representation  of  data,  the  procedure 
linkage  sequence,  and  the  implementation  of  control 
structures. 

4.1  Introduction 

^he  design  of  the  SUE- 1 1  compiler  was  always  governed 
by  the  following  two  considerations. 

1)  A  working  compiler  was  to  be  produced  in  a  short  period 
of  time  (less  than  six  months)  . 

2)  The  compiler  should  produce  efficient  code.  Efficiency 
in  our  case  refers  primarily  to  efficiency  in  space, 
not  time.  That  is,  whenever  a  decision  had  to  be  made 
whether  to  produce  code  that  will  execute  faster  in  a 
handful  of  cases,  or  to  produce  a  small  amount  of  code 
for  the  majority  of  cases,  the  space  consideration  was 
chosen.  Space  and  time  considerations,  however, 
frequently  coincide. 

At  all  times,  the  fact  that  the  SUF-11  compiler  was 
meant  to  produce  systems  software,  including  operating 
systems,  was  kept  in  mind.  Accordingly,  code  sequences 
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which  might  cause  relevant  data  to  be  destroyed  at  the 
occurrence  of  a  hardware  interrupt,  were  discarded. 

Ve  felt  that  no  assumptions  regarding  the  load  point  of 
a  program  should  be  made.  As  a  result,  all  cede  produced  by 
the  SUE-11  compiler  is  position  independent  (relocatable), 
fortunately,  position  independent  code  can  be  produced  for 
the  PDP-11  with  only  a  slight  loss  in  execution  speed,  by 
indirect  addressing  and  the  use  of  register  7  (instruction 
counter)  as  an  index  register.  Position  independence  gives 
us  the  capability  of  overlaying  program  segments,  probably 
an  essential  requirement  for  programs  on  a  minicomputer. 

Tn  the  following  sections,  we  will  refer  to  the  example 
in  figure  4.1—1  in  our  discussion  oF  the  runtime 
organization.  It  shows  the  logical  structure  of  an  example 
set  of  procedures  comprisincr  a  SUE  program.  Each  rode  in  the 
diagram  represents  both  the  program  and  the  data  blocks  oE  a 
procedure . 
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Figure  4.1-1  A  family  of  Procedures 


4 . 2  Funtime  Storage  Organization 


4.2.1  Introduction 


In  a  similar  vein  to  position  independent  code,  we  felt 
that  data  should  not  reside  at  absolute  machine  addresses 
(unless  explicitly  specified  by  the  programmer).  We  also 
felt  that  procedures  should  he  allowed  to  call  one  another 
recursively,  implying  that  several  copies  of  the  local  data 
for  a  procedure  may  be  required  at  any  point  in  execution, 
doth  requirements  are  met  by  the  use  of  a  data  stack. 

As  mentioned  in  section  2.5,  the  PDP-11  provides  each 
o^  the  general  purpose  registers  with  the  capability  of  a 
stack  pointer,  by  the  use  of  the  autoincrement  and 
autodecrement  addressincr  modes.  Pegister  6  is  used  by  the 
PDP-11  hardware  as  the  processor  stack  pointer.  That  is,  a 
machine  interrupt  causes  the  central  processor  status 
register  and  the  instruction  counter  to  be  pushed  onto  the 
stack  pointed  at  by  register  6.  Since  register  6  was 
already  used  by  the  hardware  as  a  stack  pointer,  we  decided 
to  use  register  6  as  our  data  stack  pointer  (referred  to 
from  now  on  as  PS,  for  run  stack  pointer) .  If  register  6  is 
never  allowed  to  point  below  relevant  data,  the  occurrence 
of  an  interrupt  will  leave  all  data  intact.  Thus,  by  the 
use  of  register  6  as  the  data  stack  pointer,  we  ensure  that 
the  programmer  need  never  worry  about  the  security  of 
nroaram  data. 

x. 
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The  addressina  scheme  of  the  SUE- 11  implementation 
closely  resembles  that  used  by  the  PASCAL  compiler  for  the 
CUC  6000  family  of  computers  [WTFTH  1971b]. 

Fvery  procedure  is  considered  to  have  associated  with 
it  a  data  segment,  consisting  of  a  header,  and  the  local 
data  space  of  the  procedure. 

The  data  segments  of  procedures  are  linked  together  in 


the  dat 

a  stack 

by  two  chains. 

referred 

to 

as 

the 

d  vna  m ic 

link  and 

the  static  link 

respective! v 

Th 

e  hea 

der 

of  the  d 

ata  segme 

fit  contains,  among 

other  inf or mat i 

on. 

two 

pointers 

:  one 

associated  with 

the  dynamic  . 

link 

(d  yna 

mic 

pointer) 

,  and  th 

e  other  with  th 

e  static  1 

ink 

(stat 

ic 

pointer) 

.  The 

dynamic  pointers  1 

ink  together 

all 

exist 

ing 

data  seg 

ments  in 

the  order  of  their 

generation ; 

the 

st  a 

tic 

pointers 

link  t 

ogether  only  those 

data  seginen 

ts  accessa 

ble 

from  the 

currentl 

y  executing  proced 

ure.  See  Fig 

ure 

a. 2. 2 

-  1. 

All  local  data  in  a.  data 
of  address  couples 
displa  cement 
number  of  the 
are  allocated 

The 


segment  are  addressed  by  means 
by  a  base  address,  and  a 
s  referred  to  as  the  order 
segments  of  a  procedure 
allow  recursive  calls  on  the 
achieved  by  a 


formed 
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variable.  The  data 
dynamically,  to 
allocation  may  be 


procedure . 


push  of 
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data  stack  of  the  number  of  bytes  required  tc  accomodate  the 
Drocedure . 


The  header  of  a  procedure  is  located  at  the  bottom  of  a 
data  segment.  The  base  address  of  the  data  segment  is 
considered  to  be  the  address  of  the  static  pointer  located 
in  the  header,  and  all  displacements  for  local  variables 
(or^er  numbers)  are  calculated  from  this  address.  A  close 
scrutiny  of  the  autodecrement  and  autoincrement  addressing 
modes  will  show  that  stacks  grow  dynamically  from  high  core* 
addresses  towards  low  core.  As  a  result,  the  order  numbers 
of  variables  are  negative.  In  order  to  compute  the  address 
of  a  variable  contained  in  a  data  segment,  the  order  number 
of  the  variable  is  added  to  the  containing  data  segment’s 
base  address. 


The  PASCAL  implementation  report  suggests  that  the  base 
addresses  of  all  active  data  segments  (those  that  may  be 
accessed  from  the  presently  executing  procedure)  be  stored 
in  a  display.  The  display  would  then  be  contained  in 
registers  for  quick  access.  Unfortunately  for  our  purposes, 
the  PUP-11  only  contains  8  general  purpose  registers,  of 
which  registers  6  and  7  are  used  by  the  hardware.  Registers 
are  also  required  by  the  implementation  for  some  arithmetic 
operations,  and  for  temporary  locations  to  allow  us  to 
generate  a  smaller  amount  of  code  for  certain  constructs. 
The  use  of  registers  to  store  the  display  would  have 
restricted  the  nesting  of  procedures  to  some  small  number,  a 
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restriction  unacceptable  for  our  purposes.  A  scheme  used  in 
the  Burroughs  5C00  ALGOL  implementation  [burroughs  1^69]. 
was  decided  upon. 

Two  pointers  referred  to  as  the  DO  and  the  D1  pointers 
in  the  following  sections  (contained  in  R4  and  P5 
respectively),  are  used  to  provide  addressability  to  all 
active  variables.  The  DO  pointer  is  always  set  to  the  data 
segment  of  the  most  global  (level  0)  procedure,  and  the  PI 
pointer  is  set  to  the  data  segment  of  the  most  recently 
activated  procedure.  In  the  majority  of  data  accesses,  the 
variable  will  be  addressable  through  these  two  pointers. 
the  access  is  to  a  variable  located  in  the  data  segment  of  a 
procedure  nested  between  the  global  procedure  and  the  most 
local  procedure,  the  static_link  allows  us  to.  find  the  base 
address  of  the  data  segment  containing  the  variable.  In 
order  to  facilitate  the  quick  pointer  "chasing”  necessitated 
bv  this  scheme,  the  base  address  of'  a  data  segment 
(contained  in  a  register)  is  always  the  address  of  the 
contained  static  pointer.  Consequently,  with  a  single  PDP- 
11  instruction,  the  static  link  can  be  followed  one  level. 

4.2.3  Examples  of  Data  Access 

We  refer  to  figure  4.2. 2-1  for  an  example  of  the  data 
stack  layout  after  the  calling  sequence  A->R->D->r. 
Procedure  A  is  the  global  procedure  and  has  no  return  value. 
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or  parameters.  Procedure  B  has  been  called  with  parameters, 
and  a  return  value  is  expected.  Procedure  D  has  been  called 
with  parameters  and  no  return  value  is  expected.  Procedure 
T  has  been  called  without  parameters,  and  has  no  return 
value.  The  declarations  for  the  procedures  have  produced 
the  following  storage  assignments. 

Variable  Containing  Static  Order  Type 

Name  Procedure  Nesting  Number 

Level 

Global  A  0  -8'  bit  (16) 

Switch  P.  1  -10  bit  (P) 

Value  T)  1  -24  bit  (16) 

Local  T  2  -4  bit  (16) 


The  code  required  to  push  each  of  the  above  variables 
onto  the  top  of  the  data  stack  (pointed  at  by  PS)  is  the 
following . 

To  push  Global  :  NOV  -8(00), -(PS) 

To  push  Local  :  NOV  -4(01),- (PS) 
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Value  is  not  addressable  direcly  either  through  ^0  or  Dl. 
Therefore  first  follow  the  static  link  to  the  data  segment 


of  the  procedure  containing  Value.  The  pointer  Pi,  point 
to  the  start  of  the  static  link,  namely  the  static  pointer 
of  procedure  1?. 

To  push  Value  :  tfOV  (PI) ,?2  /*  follow  link 

one  level. 
wOV  -24  (P.2)  ,  -  (*S) 

A  worthwhile  optimization  would  he  to  remember  the  contents 
of  P2 . 

Switch  is  a  byte  variable,  but  is  not  addressable  ^rom 
procedure  T‘. 

4.3  rxpression  Evaluation 

The  format  of  arithmetic  operations  on  some  machines 
insists  that  at  least  one  of  the  operands  of  the  expression 
be  located  in  a  register.  The  evaluation  of  expressions 
often  result  in  values  which  must  be  stored  temporarily. 
These  values  are  often  stored  in  registers,  but  if  no 
registers  are  available  and  an  arithmetic  operation  must  be 
performed,  the  contents  of  one  of  the  registers  must  be 
stored  in  a  temporary  memory  location.  A  great  deal  of 
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effort  is  spent  in  sore  compilers  to  allocate  registers 
ef  f ic ient ly. 

The  PDP-11  instruction  set  allows  us  to  handle 
expression  evaluation  in  another  fashion.  The  operands  of 
all  PDP-11  instructions,  including  those  performing 
arithmetic,  may  reside  in  any  addressable  memory  location. 
We  make  use  of  this  facility,  and  evaluate  all  expressions 
on  top  of  the  data  stack. 

Since  data  segments  are  allocated  dynamically  by 
pushing  the  data  stack  the  amount  required  to  hold  the  data 
segment,  we  car.  use  the  top  of  the  data  stack  as  temporarv 
memory  locations.  We  allocate  these  temporary  locations 
dynamically  at  run  time  by  a  push  of  the  data  stack.  As 
soon  as  the  temporary  is  no  longer  needed,  its  value  is 
ooDped  from  the  data  stac>,  and  thus  the  stack  is  reset  +o 
the  position  before  the  allocation.  Py  this  scheme, 
temporary  locations  are  only  used  as  long  as  their  values 
are  needed.  If  a  procedure  call  occurs  while  temporaries 
still  reside  on  the  data  stack,  the  data  segment  of  the 
called  procedure  will  be  allocated  above  any  temoorar ies 
residing  there. 

To  evaluate  an  arithmetic  expression,  the  first  operand 
is  pushed  onto  the  data  stack,  the  second  operand  is  pushed 
onto  the  stack,  and  t^en  the  operation  is  applied  to  the  top 
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two  elements  of  the  s+ack,  popping  both  of  their  values  and 
replacing  them  with  He  result  of  the  operation. 

MOV  operand  1 ,  -  (PS) 

MOV  operan  d2  ,- (PS) 

OP  (PS)-*-,  ( F  5 ) 

^he  compiler  optimizes  the  above  code  sequence  to: 

M0V  operand  1 (?S) 

OP  operand? , (PS) 

4 . 4  nata  Representation 

We  now  discuss  the  internal  representation  of  the  SP^ 
data  types,  and  the  way  in  which  operations  on  them  are 
implemented.  One  of  the  deficiencies  of  the  PDP-11 
instruction  set  is  the  lack  of  facility  whereby  structured 
data  types  may  be  easily  manipulated.  These  deficiencies 
fall  into  two  categories. 

1)  The  PDP-11  instruction  set  allows  the  manipulation  of 
only  eight  bit  bytes,  or  sixteen  bit  words  with  one 
instruction.  The  SITE  system  language  however  allows 
the  manipulation  of  structured  types  (arrays,  records) 
which  in  the  majority  of  cases  are  greater  than  sixteen 
bits  in  length.  No  problem  arises  as  long  as  only 
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2) 


The  PDP-11  allows  address  calculations  to  occur  only 
through  one  register  at  a  time.  Unless  absolute 
addresses  are  to  be  used,  registers  may  not  be 
conveniently  used  for  indexing  to  an  array  element. 
Vhen  dealing  with  data  types  which  involve  either 
implicitly  or  explicitly  a  variable  offset  from  the 
base  address  of  a  variable  of  the  type  (array  s)  ,  the  use 
of  an  index  register  would  be  most  useful. 


Numeric  Types 


Numeric  types  in  the  SUF- 1 1  implementation  are  those 
auantities  whose  internal  representation  is  either  an  eight 
bit  byte,  or  a  sixteen  bit  word.  A  byte  variable  may  have 
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values  in  the  range  -123  to  127,  since  the  high  order  bit  of 
the  byte  is  a  sign  bit. 

All  numeric  operations  allowed  by  the  SUP  System 
Language,  except  multiplication,  division  and  modulo,  are 
implemented.  These  operations  were  not  included  on  account 
of  the  lack  of  a  hardware  multiply  and  divide  on  many  PPP- 
11’s.  As  mentioned  in  section  4.3,  all  arithmetic 
operations  are  evaluated  on  top  of  the  data  stack.  Since  the 
arithmetic  operations  add  and  subtract  can  only  be  applied 
to  fullword  quantities,  all  byte  quantities  are  converted  to 
fullword  before  being  placed  on  the  data  stack.  This  is 
easily  accomplished  by  first  moving  the  byte  quantity  into  a 
register  (section  2.3.1). 

Scalar  Types 

The  values  of  programmer  defined  scalar  types  are 
ordered  by  the  position  of  the  identifier  names  in  the 
defining  list.  Internal  values  are  assigned  in  order 
starting  with  zero,  and  the  base  2  logarithm  of  the  largest 
value  determines  the  number  of  bits  required.  The  symbolic 
constants  cO , c 1 , c2 ,...., cn  of  a  scalar  type,  are  represented 
internally  as  the  integers  0,1,...n. 

£  °  ill  t  X  § 
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The  SU-17  System  Language  allows  the  use  of  pointers  in  a 
restricted  sense,  namelv  by  requiring  the  declaration  of  the 
pointer  to  indicate  the  type  which  results  from  following 
the  pointer  value  (dereferencing).  As  a  result,  the 
compiler  is  able  to  type  check  the  result  of  pointer 
dereferencing,  and  furthermore,  the  compiler  can  determine 
by  context  when  pointer  dereferencing  is  required.  ^or 
example: 


declare 


poin ter  to  bit  (16)  (Address)  ; 


Pointer  values  are  implemented  by  hardware  addresses. 
In  our  implementation,  pointers  are  treated  as  sixteen  bit 
quantities.  Dereferencing  of  pointers  is  implemented  by 
loading  the  pointer  into  one  of  the  available  general 
purpose  registers,  and  then  using  indirect  addressing  to 
load  its  value  onto  the  data  stack. 


Powerset  Types 


A  powerset  type  has  as  its  possible  values  the  set  of 
all  subsets  of  some  base  type,  and  is  partially  ordered  by 
set  conta i nement.  Tor  example: 


declare 


powerset  of  Colour  (hue)  ; 


-P6- 


Tn  the  above  declaration.  Hue  is  declared  to  be  a 
oowerset  variable  which  can  contain  elements  of  the  type 
Colou  r . 

Powerset  types  are  implemented  as  bit  strings,  with  the 
i*th  hit  in  the  bit  string  (counting  ^rom  the  riaht) 
corresponding  to  the  i*th  value  in  the  base  type.  Powerset 
operations  are  implemented  by  combinations  of  logical  "and" 
and  "or"  operations.  Since  we  can  manipulate  only  eight  or 
sixteen  bit  quantities  with  one  instruction,  we  rlace  a 
restriction  on  the  sice  of  powerset s.  A  powerset  mav  contain 
at  most  sixteen  elements. 

Array  Types 

An  array  is  a  structure  consisting  of  a  fixed  number  of 
components,  all  of  the  same  type.  The  dimensionality  of  the 
array  is  specified  in  the  array  declaration. 

In  the  STTP  System  language,  the  bounds  of  arrays  must 
be  constant,  and  are  known  at  compile  time.  As  a  result,  we 
can  allocate  to  a  variable  of  the  array  type  (at  compile 
time)  the  amount  of  storage  required  to  hold  all  the 
elements  of  the  array.  Tn  section  4.2.2,  we  mentioned  that 
the  data  stack  grows  from  high  core  addresses,  toward  low 
core,  and  as  a  result,  the  displacement  of  variable 
addresses  from  the  base  address  of  their  containing  data 
segment,  is  negative.  He  can  also  consider  the  address  of  an 
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array  element  to  consist  of  the  base  address  of  the  array, 
offset  by  the  index  of  the  element  into  the  array.  As  a 
result,  array  indexes  are  subtracted  from  the  base  address 
of  the  array  rather  then  added  as  in  most  implementations. 

We  give  an  example  of  the  code  required  to  access  an 
array  element.  The  declarations  for  the  array  are  shown 
below.  We  assume  that  the  declarations  occurred  in  the  data 
block  of  the  presently  executing  procedure.  ^he  base 
address  of  the  data  segment  containing  the  declared 
variables,  is  thus  contained  in  register  ^1  (section  4.2.2). 

declare 

— *  y 

(0  to  500)  (Index)  , 

array  ((0  to  500))  of  bit  (16)  (^ullword)  ; 

The  above  declaration  reserves  storage  for  a  501  member  16 
bit  array  named  ^ullword,  and  declares  the  variable  Index  to 
take  on  the  values  0-500.  A  word  of  storage  is  thus  reserved 
for  Index.  The  displacement  from  the  base  address  of  the 
containing  data  segment,  of  the  variables  Index  and  Fullword 
are  di,  and  df  respectively.  To  access  the  array  element 
^ullword (Index)  we  emit  the  following  sequence  of  code. 
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/*  ^ush  Index  onto  data  stack  */ 

MOV  di  (Dl)  ,-  (VS) 

/*  Move  base  address  of  data  segment  into  VI  */ 
MOV  Dl  ,B  1 

/*  Subtract  Index  from  base  address  and 
pop  data  stack  */ 

SUB  (PS)+,P1 

/*  Put  array  element  onto  the  datastack  */ 

MOV  df  (PI)  ,-  (PS) 

Record  Types 

A  record  type  is  a  structure  consisting  of  a  number  of 
components,  possibly  of  different  types.  The  record 
definition  specifies  for  each  component  of.  the  record, 
called  a  field,  the  type  of  the  field,  and  an  identifier 
which  denotes  it. 

A  record  type  may  have  a  variable  format  where  one  of 
the  fields  of  the  record,  called  the  tacr  field,  indicates 
the  chosen  format  of  the  record  at  any  time.  For  example: 

type  Monthlv_att.  endance  = 
record 

(1  to  12)  (Month)  , 

(0  to  2000)  (Year)  , 

case  (Presen t , Absent , Vacat ion)  tap  Attendance; 

Present:  (0  to  1000)  (Hours_vorked) ; 
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end 


Absent:  (0  to  31)  pay s_absent. )  ; 

Vacation:  (0  to  31)  (Vacation_days) 


end 
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subfields  of  a  record  from  the  start  of  the  record.  As  a 
result,  no  runtime  index  calculations  from  the  start  address 
is  necessary.  The  address  of  the  field  of  a  record  is  the 
base  address  of  the  data  segment,  plus  the  displacement  to 
the  start  of  the  record,  plus  the  diplacement.  of  the  field 
from  the  start  of  the  record.  The  fields  of  a  record  are 
placed  on  storage  boundaries,  and  no  packing  is  attempted. 
The  amount  of  storage  allocated  to  a  variable  of  the  record 
type  is  the  sum  of  the  fixed  part  of  the  record  and  the 
laraest  variant. 


Group  Types 


Group  types  are  similar  to  record  types,  except  that 
the  default  alignment  of  the  fields  is  by  consecutive  bits 
in  a  group,  ignoring  "natural"  storage  units  (packed 
records) .  Group  types  are  not  implemented  in  the  SUE- 11 
compiler.  The  PDP-11/20  provides  no  multiple  shift 
instruction,  all  shifts  must  be  done  in  steps  of  one.  Too 
much  code  would  have  been  required  to  break  out  the  fields 
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of  a  group  for  manipulation,  to  roak«*  use  of  the  group  type 
pract ica 1 . 

Manipulation  of  Structured  Data  Types 

JiS  with  all  variables,  the  manipulation  of  variables 
whos'e  length  is  greater  than  16  bits  (fullword)  is  done  on 
the  top  of  the  data  stack.  Put,  instead  of  pushing  the 
whole  variable  of  the  type  onto  the  runstack,  which  involves 
several  operations,  only  the  address  of  the  variable  is 
placed  on  the  data  stack  top.  The  only  manipulation  allowed 
on  non  numeric  quantities  is  comparison  for  equality  and 
assignment.  Having  the  addresses  of  the  two  structured 
variables  which  are  to  be  either  assigned,  or  compared  is 
enough  for  us  to  be  able  to  generate  a  loop  which  will 
handle  the  operation  byte  by  byte. 


4 . 5  Puntime  Code  Organization 

The  following  section  will  describe  the  code  sequences 
for  procedure  linkage,  and  for  the  implementation  of  control 
s  true tur es . 

4.5.1  Introduction 


The  code  segment  for  a 
physically  disjoint  from  its 


procedure  is  both  locrically  and 
data  segment.  In  fact,  for 
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testing,  we  loaded  code  segments  into  low  core  addresses  of 
the  PTP-11  expanding  as  necessary  towards  high  numbered 
core,  while  we  initialized  the  data  stack  for  programs  to 
the  highest  core  address  available,  and  allowed  the  data 
stack  to  grow  dynamically  towards  low  numbered  core. 

The  code  area  layout  of  a  procedure  is  shown  in  Figure 
4. 5. 1-1.  Each  procedure  contains,  in  its  code  segment,  the 
entry  addresses  of  all  contained  procedures.  ^he  entry 
addresses  are  at  a  known  displacement,  at  compile  time,  from 
the  start  of  the  code  segment.  The  Svstem/360  linkage 
editor  will  insert,  for  each  contained  procedure,  its  offset 
from  the  beginning  of  the  combined  code  segments  of  a 
program.  The  SUE-11  loader  will,  at  load  time,  traverse  the 
procedure  tree,  and  change  all  displacements  into  absolute 
machine  addresses. 

ror  use  in  the  following  sections,  we  define  the 

following  displacements  into  the  data  segment  and  the  code 

segment  of  a  procedure.  See  Figure  4.5. 1-1  and  figure 
4. 5.2. 3-1  for  further  clarification. 

dl  -  The  displacement  from  the  base  of  a  data  segment 

containing  the  static  pointer,  dynamic  pointer,  entry 
address,  and  parameters  of  the  procedure. 

d 2  -  The  displacement  from  the  base  of  a  data  segment 

containing  the  static  pointer,  the  dynamic  pointer, 
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Pntry  Point 
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|  Preliminary  Code  | 


Pntry  Address  PI 


Pntry  Address  P2 


Pntry  Address  Pn 


Code  for  Procedure 


4.5.  1-1  A  procedure  Code  Segment 


. 


entry  address,  parameters,  return  address  and  local 
data  of  the  procedure. 

d3  -  The  entry  address  of  a  procedure  is  located  at  a 
displacement  of  d3  from  the  start  of  the  code  segment 
of  its  containing  procedure. 

4.5.2  Procedure  Linkage 

4.5. 2.1  Introduction 


Fe  first  make  the  following  definitions. 

An  U£level  call  is  a  call  from  a  procedure  at  level  m  to  a 
procedure  at  level  n  where  m>n.  Tor  example,  in  figure  4.1- 
1,  procedure  G  calls  n. 

A  crosslevel  call  is  a  call  from  level  m  to  level  m.  ?or 

—  *5"*™  — “T" 

example,  in  ^igure  4.1-1,  P  calls  P. 

A  d ownle vel  call  a  call  from  level  m  to  level  n,  where 

m<n.  For  example,  in  Figure  4.1-1,  D  calls  E. 

An  entry  sequence  (prologue)  and  a  return  sequence 
(epilogue)  are  located  at  the  beginning  and  end, 
respectively,  of  the  code  segment  for  a  procedure.  Thus, 
only  one  copy  of  both  the  prologue  and  epilogue  are  needed 
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for  every  procedure.  In  contrast,  many  copies  of  the  code 
sequence  for  linking  to  a  particular  procedure  are  needed: 
one  for  each  time  the  procedure  is  called.  The  primary 
reason  for  including  the  linkage  code  in  the  calling 
sequence,  is  that  a  procedure  never  knows  the  lexic  level 
from  which  it  was  called.  Since  we  expect  procedures  to  be 
called  frequently,  we  wish  to  emit  as  efficient  a  calling 
sequence  as  possible. 


4 . 5. 2 . 2  Calling  Sequence 

This  section  outlines  the  order  in  which  the  code  for 
procedure  linkage  is  emitted. 

1)  Peserve  Space  for  Return  Value 

If  the  procedure  being  called  returns  a  value,  space 
must  be  reserved  on  the  data  stack  for  the  return  value. 
As  mentioned  in  section  4.4,  all  numeric  types  are 
manipulated  as  fullword  quantities  on  the  data  stack.  Since 
the  return  value  of  a  procedure  may  partake  in  arithmetic 
expressions,  all  numeric  return  values  are  treated  as  16  bit 
quantities . 

As  mentioned  in  section  4.4,  structured  data  types  are 
manipulated  by  forcing  their  addresses  onto  the  data  stack, 
rather  than  their  values.  Consequently,  if  a  structured 
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2 )  Set  Op  Dynamic  Link 


The  dynamic  link  is  extended  by 
pointer  of  the  called  procedure.  The  dy 
together  all  data  segments,  in  th 
activation.  Accordingly,  the  dynamic  poi 
the  address  of  the  calling  procedure’s 
base  address  of  the  calling  procedure’s 
contained  in  PI.  Consequently,  in  order 
pointer,  the  following  instruction  is  emi 


setting  the 
namic  link 
e  order  of 
nter  should 
data  seamen 
data  segm 
to  set  the 
tted . 


d  yna 
cha 
th 
cont 
t. 
ent 
dyna 


mic 
i  ns 
ei  r 
ain 
The 
is 
mic 


MOV  D1,-(?S) 


3)  Set_U£_Static_Link 

The  static  link  chains  together  those  data  segments 
accessable  from  the  currently  executing  procedure.  The 
static  pointer  must,  therefore,  be  set  to  the  address  of  the 
father  Drocedure’s  data  segment.  The  code  that  is  emitted 
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to  set  the  static  pointer  depends  on  whether  the  call  is 
uplevelf  downlevel,  or  crosslevel. 

If  the  call  is  a  downlevel  call,  then  the  father 
procedure  is  also  the  calling  procedure.  Accordingly,  to 
set  the  static  pointer,  the  following  code  is  emitted. 

MOV  D1,-(PS) 

If  the  call  is  a  crosslevel  call,  the  father  procedure 
of  the  calling  procedure  is  also  the  father  procedure  of  the 
called  procedure.  Jn  this  case,  the  following  code  is 
emitted. 

MOV  (T)1),-(PS) 

In  uplevel  calls,  the  father  procedure  may  be  at  any 
level  above  the  callino  procedure,  although  the  amount  is 
known  at  compile  time.  Therefore,  code  is  emitted  to  follow 
the  static  link  until  the  data  segment  of  the  father 
procedure  is  found,  and  then  to  set  the  static  pointer  of 
the  called  procedure  to  the  address  of  the  father's  data 
segment.  The  following  code  is  emitted. 

MOV  (PI) ,r2  /*  Follow  link  one  level  */ 

MOV  (F2) ,P2  /*  Keep  following  static  link  */ 
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MOV  (R2),-(PS)  /*  Found  it,  set  static  pointer  */ 


4)  Leave  Space  For  Entry  Address 

The  entry  address  of  the  called  procedure  will  be  saved 
on  entrance  to  the  procedure,  by  the  called  procedure's 
prologue.  Therefore,  the  following  code  to  leave  space  for 
the  entry  address,  is  emitted, 

CLP  -  (RS)  /*  Push  data  stack  */ 

5)  Load  Parameters 

The  parameters  are  now  evaluated.  The  evaluation  of 
the  parameters  will  force  the  parameter  values  onto  the  top 
of  the  data  stack,  at  the  expected  address  of  the  parameter 
within  the  called  procedure.  All  numeric  quantities  are 
passed  as  fullwords  to  facilitate  a  quicker  calling 
sequence. 

6}  Pass  Control  to  the  Called  Procedure 

9  —  ■  ■  —  —  —  —  ■■■'  —  ■■  —  -r  yr  —  —  — -  ■■  - 

The  PDP-11  provides  the  Jump  to  Subroutine  instruction 
( JSR)  to  facilitate  procedure  linkage.  The  format  of  the 
instruction  is  JSP  reg,dst  where  "reg"  may  be  any  one  of  the 
general  purpose  registers,  and  "dst"  is  an  address.  The 
execution  of  the  JSP  instruction  causes  the  following  to 
take  place: 
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1)  The  old  contents  of  "reg"  is  pushed  onto  the  processor 
stack  (PS)  . 

2)  The  value  of  the  instruction  counter  (PC)  is  placed 
into  "reg”. 

3)  The  destination  address  is  placed  into  the  instruction 
counter. 

To  pass  control  to  the  called  procedure,  its  entry 
address  must  be  known.  As  mentioned  in  section  4 .  s .  }  ,  the 
entry  address  of  every  procedure  is  located  at  a  known 
displacement  from  the  start  of  the  code  segment  of  its 
^ather  procedure  (in  the  procedure  tree) .  To  be  able  to 
call  a  non-paramo  trie  procedure  (a  procedure  which  is  not  a 
naraieter) ,  the  father  procedure  must  be  active. 
Consequently,  the  father  procedure’s  data  segment  must  be 
addressable,  either  through  DO,  Dl,  or  by  following  the 
static  links.  We  observe  in  figure  4.2. 1-1,  that  the  header 
of  each  data  segment  contains  the  entry  address,  as  well  as 
the  return  address  of  the  associated  procedure.  Having 
found  the  father  procedure's  data  segment,  we  can  retrieve 
its  entry  address  (the  start  of  the  code  segment)  and  can 
thus  find  the  entry  addresses  of  all  its  sons.  This  is  the 
principal  mechanism  whereby  the  transfer  of  control  between 
procedures  is  accomplished. 


Having  found  the  the  father  procedure's  data  segment, 
the  base  address  of  the  data  segment  will  reside  in  some 
register  Pn .  The  entry  address  of  the  f at her  procedure  is 
loaded  into  P.2  by  the  following  instruction: 


MOV  - 2  (Bn)  ,  P  2 


The  entry  address  of  the  procedure  to  be  called 
at  a  displacement  of  d3  from  the  beginning  of  its 
procedure's  code  segment.  Therefore,  to  transfer 
procedure,  the  following  code  is  emitted: 


resides 
father 
to  the 


JSP  7,^d3(P2) 


The  execution  of  the  above  instruction 
return  address  of  the  procedure  onto  the  data 
the  parameters,  but  below  the  local  data,  and 
to  the  called  procedure. 


will  push  the 
stack,  above 
then  transfer 


7)  Pest ore  the  World  on  Pet urn 


In  order  to  produce  more  efficient  code,  the  code  to 
restore  the  data  stack  after  return  from  a  procedure  is 
emitted  in  the  calling  procedure.  In  the  following 
discussion  we  refer  to  figure  4. 5. 2. 2-1.  First,  the  local 
storage  of  the  procedure  from  which  the  return  has  been 
made,  is  popped.  This  is  accomplished  by  the  following 
instruction: 
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MOV  T)  1,RS 


The  static  link,  pointed  at  by  RS  at  this  point,  is 
then  popped  from  the  stack  by  the  following  instruction: 

CLR  (RS)  + 

D1  is  then  reset  to  the  base  of  the  calling  procedures 
data  segment,  and  the  old  dynamic  pointer  is  popped  from  the 
data  stack.  The  following  instruction  is  emitted  to 
accomplish  this: 

MOV  (R S)  ,  D1 

At  this  point,  the  return  value,  if  any,  is  the  top 
element  of  the  data  stack. 


4 . 5.  2 . 3  Entry  Sequence 

We  illustrate  the  state  of  the  data  segment  of  a 
procedure  on  entry  in  Figure  4. 5. 2. 3-1.  The  following  code 
sequence  is  emitted  on  entry  to  the  procedure.  "he  code 
accomplishes  several  tasks.  First,  addressability  to  the 
local  data  of  the  procedure  is  provided  by  setting  ^1  to  the 
base  of  the  data  segment  of  the  procedure.  The  following 
code  is  thus  emitted: 
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figure  U.E.2.3-1  State  of  Data  Stack  on  Entry 


MOV  FS,D1 


/*  At  this  point,  PI  points  above  the  para¬ 
meters.  Force  it  to  point  below  the 
parameters  */ 

ADD  #d 1 , Dl 

Secondly,  in  order  to  be  able  to  call  procedures  local 
to  the  procedure  to  which  entry  has  just  been  made  (section 
4. 5. 2. 2),  the  entry  address  of  the  procedure  is  stored  by 
emitting  the  following  instruction: 

MOV  PC  ,-2  (Dl) 

As  nrentioned  in  section  4.5.1,  the  entry  addresses  of 
all  contained  procedures  are  located  at  the  beginning  of  the 
code  segment  of  a  procedure.  The  code  to  branch  around  the 
entry  addresses  is  thus  emitted. 

BP  #  (entry  addresses) 

At  this  point,  storage  has  been  reserved  for 
parameters.  The  following  code  is  emitted  to  reserve  space 
on  the  data  stack  for  local  data,  excluding  parameters. 

SUB  #  (d2-d  1)  ,RS 
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4 .  f .  2 . 4  Peturn  Sequence 


If  necessary,  the  return  value  is  computed  and  stored 
in  its  reserved  space  just  belov  the  dynamic  pointer,  and 
then  the  return  is  emitted.  By  the  use  of  the  SUE  System 
Language  construct  return  from  Procedure_name,  procedure 
return  may  occur  through  several  levels  of  procedure  calls. 
Accordingly,  addressability  must  be  provided  to  the  data 
segment  of  the  procedure  from  which  the  return  is  being 
made.  The  return  address  and  the  return  value  is  located  in  , 
the  data  segment.  We  assume  that  addressability  is  through 
some  register  Bn. 

If  return  is  through  more  than  one  level,  then  we  reset 
D 1  to  point  to  the  base  of  the  data  segment  of  the  procedure 
from  which  return  is  being  made.  This  is  accomplished  by  the 
following-  instruction: 

MOV  Bn , D1 

The  return  address  is  at  a  displacement  of  dl  from  the 
base  of  the  data  segment  now  pointed  at  by  Dl.  The  following 
instruction  to  branch  to  the  return  address  is  emitted: 

JMP  a)- dl  (Dl) 
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4.5.3  Control  Structure  Implementation 


This  section  briefly  describes  the  implementation  of 
the  SUE  control  structures. 

The  c;ycle  construct  is  implemented  by  a  backward  branch  to 
the  beginning  of  the  loop  from  the  end  of  the  statement  list 
comprising  the  loop. 


The  do  construct  is  implemented  by  forcing  the  upper 
bound  (or  lower  bound  as  the  case  may  be)  of  the  iteration 
onto  the  top  of  the  data  stack.  Each  time  through  the 
iteration,  the  control  variable  is  stepped  and  compared 
against  the  final  value  on  the  top  of  the  data  stack.  When 
the  iteration  is  complete,  the  loop  is  exited. 


The  case  construct  is  implemented  by  emitting 
into  the  code  stream.  The  branch  table  contai 
for  each  of  the  possible  values  of  the 
unspecified  cases  will  cause  a  branch  aro 
compound . 


a  branch  table 
ns  one  entry 
tag  type.  All 
und  the  case 


-73- 


5  Conclusions 


A  SUF  System  Language  compiler  for  the  PDP-11  has  been 
developed,  and  at  the  present  time  the  object  code  produced 
is  being  verified. 

The  PDP-11  object  module  produced. by  the  compiler  is 
currently  transferred  to  a  simulator  for  the  PDP-11  [Moffat 
1972].  The  simulator,  which  is  written  in  ALGOL W  [Wirth 
1966],  executes  on  the  System/360/370  family  of  computers. 
The  use  of  a  simulator  has  been  very  beneficial  in  testing, 
and  has  saved  a  great  deal  of  time  for  the  following 
reasons.  The  simulator  provides  diagnostic  messages 
whenever  errors  occur,  and  provides  a  dump  of  specified 
areas  of  core  at  the  termination  of  a  program.  If  the  SUF- 
11  object  programs  were  to  execute  on  a  PDP-11  during 
testing,  extra  time  would  be  needed  to  transfer  PDP-11 
object  modules  to  the  machine.  By  the  use  of  a  simulator, 
the  time-lag  between  the  production  of  a  PP^-II  object 
program,  and  the  execution  of  the  program  on  the  PDP-11 
machine  has  been  eliminated,  allowing  more  efficient 
testing. 
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5 . 1  SUE-11  Code  Efficiency 


In  this  section,  we  will  discuss  our  thoughts  on  the 
efficiency  of  the  code  produced,  and  some  ideas  on  how  it 
might  have  been  improved. 

The  efficiency  of  the  code  emitted  by  the  SUE- 11 
compiler  has  been  influenced  most  by  the  following  two 
considerations.  ^irst,  the  code  that  we  wished  to  produce 
had  to  be  relocatable.  We  felt  that  this  was  an  essential 
requirement  if  a  flexible  program  overlaying  scheme  was  to 
be  designed  at  some  time  in  the  future.  Secondly,  and  more 
important,  data  was  not  to  reside  at  absolute  machine 
locations  (except  if  the  programmer  explicitly  specifies) , 
so  as  to  allow  more  efficient  use  of  data  storage  space,  and 
to  allow  recursive  calls  on  procedures.  These  two  design 
criteria  have  allowed  the  SUE-11  compiler  to  be  compatable 
to  a  very  large  extent  with  the  STJE-360  compiler. 
Unfortunately,  it  has  also  resulted  in  some  code 
i nef f iciency . 


5.1.1  Disadvantages  of  the  Runtime  Storage  Organization 

The  runtime  storage  organization  used  by  the  SUE-11 
compiler  causes  some  inefficiency  in  two  areas:  array 
references  and  procedure  linkage. 
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As  mentioned  in  section  4.4,  the  PDP-11  machine 
architecture  does  not  allow  indexing  through  two  registers 
in  a  single  instruction.  In  our  addressing  scheme,  two 
levels  of  indexing  are  required  to  access  an  array  element: 
one  index  to  the  base  of  the  data  segment  containing  the 
array,  and  a  second  index  to  the  selected  array  element. 
Consequently,  no  single  PDP-11  instruction  may  be  used  to 
access  an  array  element  in  our  storage  scheme.  In  fact,  4 
PDP-11  instructions  occupying  6  PDP-11  words  are  needed  (see 
section  4.4). 

Pn  alternative  method  of  runtime  storage  organization 
which  assigns  absolute  storaqe  addresses  to  all  variables, 
could  have  been  used.  The  PDP-11  instruction  set  provides 
more  flexibility  in  addressing  if  data  reside  at  absolute 
addresses.  If  all  variables  were  to  reside  at  absolute 
addresses,  the  same  number  of  instructions  occupying  the 
same  number  of  PDP-11  words  would  be  required  to  access 
simple  numeric  types.  However,  when  accessing  an  array 
type,  a  saving  of  50^  in  both  the  number  of  instructions  and 
the  amount  of  space  occupied  could  be  achieved. 

Peferring  to  the  example  in  section  4.4,  if  the  array 
Fullword  were  to  reside  at  absolute  address  af,  and  the 
variable  Index  were  to  reside  at  absolute  address  ai,  the 
following  instructions  could  be  emitted  to  access  the  array 
element  Fullword (Index) : 
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MOV  6)fai,P1  /*  load  Index  into  PI  */ 

MOV  af(Fl),P1  /*  Load  array  member  into  PI  */ 


The  assignment  of  absolute  addresses  to  all  variables 
has  two  obvious  disadvantages-  First,  absolute  addressing 
disallows  recursive  calls-  Secondly,  unless  the  calling 
sequence  was  known  in  advance,  more  space  would  be 
frequently  required  for  data  segments  during  execution,  if 
absolute  addresses  were  used  rather  than  dynamic  allocation. 
Dynamic  allocation  implies  that  at  any  point  during 
execution,  storage  is  required  only  for  active  data 
segments.  Conversely,  absolute  addressing  implies  that  at 
all  times  during  execution,  storage  is  required  for  all  data 
segments  of  a  program.  Accordingly,  dynamic  allocation 
involves  the  use  of  more  instructions  for  array  accesses, 
and  requires,  on  the  average,  less  space  for  data.  Absolute 
addressing  on  the  other  hand  involves  the  use  of  more 
storage  for  data,  but  provides  a  50%  saving  in  the  code  for 
array  accesses.  Programs  that.  involve  frequent  array 
accesses  would  probably  benefit  from  the  use  of  absolute 
addresses . 

A  compromise  between  the  two  methods  of  storage 
allocation  can  be  provided  within  the  framework  of  the  SUF 
language.  We  can  allow  the  programmer  to  allocate  arrays  at 
absolute  machine  addresses  by  using  the  absolute  attribute 
of  the  language.  It  would  then  be  the  responsibility  of  the 
user  to  assign  an  address  to  the  variable,  and  to  make  sure 
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that  no  conflict  with  other  data  or  program  will  arise.  We 
then  hope  to  optimize  references  to  arrays  residing  at 
absolute  addresses. 

The  runtime  storage  organization  has  also  resulted  in  a 
slightly  more  expensive  linkage  sequence.  If  variables  were 
to  reside  at  absolute  addresses,  a  static  and  a  dynamic  link 
would  not  be  required,  and  the  code  for  updating  these  links 
would  not  have  to  be  emitted. 

As  an  alternative  to  assigning  absolute  addresses  to 
variables  and  to  still  permit  a  cheaper  linking  sequence,  a 
non  recursive  stack  could  be  used.  One  pointer  to  the  base 
of  the  stack  would  be  adequate  to  provide  addressability  to 
all  variables.  Once  again,  no  dynamic  and  static  links 
would  be  required  resulting  in  a  simpler  linking  sequence. 
An  added  advantage  to  this  method  is  the  fact  that  storage 
for  data  would  be  still  allocated  dynamically,  although 
recursive  calls  are  d5.sall.owed. 

This  scheme  also  has  two  disadvantages.  First,  array 
references  are  still  as  expensive  as  with  a  recursive  stack. 
Secondly,  the  calling  tree  for  the  program  must  be  available 
before  address  calculations  can  be  made. 

Since  we  do  not.  expect  procedure  calls  to  be  as 
frequent  as  array  references,  the  added  advantage  of 
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permitting  recursive  calls,  and/or  not  requiring  a  calling 
tree  at  compile  time,  outweigh  the  disadvantage  of  extra 
code  in  the  calling  sequence. 


5.  1.2  Disadvantages  of  Code  Organization 

The  decision  to  emit  relocatable  code  has  resulted  in 
surprisingly  few  inefficiencies.  The  two  areas  where  some 
saving  in  the  amount  of  code  might  have  been  achieved  are  in 
case  statements,  and  procedure  calls. 

In  the  implementation  of  the  case  statement  (section 
4.5.3),  a  branch  table  is  used  to  branch  to  the  selected 
alternative.  Since  the  code  is  relocatable,  the  branch 
table  consists  of  offsets  from  the  beginning  of  the  case 
rather  than  absolute  addresses.  Consequently,  each  time  a 
case  alternative  is  selected,  the  entry  in  the  branch  table 
corresponding  to  the  alternative  must  be  relocated.  This  is 
accomplished  by  one  instruction. 

Jn  the  implementation  of  procedure  calls  (section 
4. 5. 2. 2),  the  entry  address  of  a  procedure  is  contained  in 
the  code  segment  of  the  father  procedure.  Accordingly,  one 
instruction  must  be  emitted  to  load  the  entry  address  of  the 
father  procedure,  and  another  instruction  to  branch  to  the 
procedure.  If  absolute  code  vrere  emitted,  tfre  branch  to  the 
procedure  would  be  sufficient. 


-79- 


The  disadvantages  of  using  absolute  code  are 
following.  The  separate  compilation  of  hierarchica 
related  procedures  would  not  be  practical.  Wore  importa 
if  overlaying  were  to  be  implemented,  it  would  have  to 
done  at  compile  time,  rather  than  dynamically,  as  can 
done  with  the  present  implementation. 


the 

iiy 

nt, 

be 

be 


The  PDP-11  has  hardware  features  that  can  be  used 
conveniently  only  by  emitting  specific  instructions.  These 
usually  occur  when  the  handling  of  input/output  is  being 
attempted.  Unfortunately,  within  the  framework  of  the  SUE 
language,  these  operations  can  not  be  handled  in  as  few 
instructions  as  could  be  done  by  a  program  in  assembler 
language.  This  deficiency  is,  however,  a  deficiency  of  all 
high  level  languages,  and  to  be  able  to  conveniently  write 
programs  to  handle  these  operations  shows  the  power  of  the 
SUE  language.  Of  course,  specific  instructions  could  also 
be  emitted  by  the  use  o^  the  inline  feature  of  the  language. 

In  general,  the  code  produced  by  the  SUF-11  compiler  is 
quite  good  considering  the  design  criteria.  Wore  extensive 
peephole  optimization  would  certainly  help  in  some  localized 
areas,  and  hopefully  this  can  be  added  in  the  near  future. 
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5 . 2  SUF  Implementations  for  other  Minicomputers 


We  have  tried  to  point  out  throughout  the  thesis  that 
the  SUE  System  Language  is  to  a  great  extent  machine 
independent.  The  fact  the  the  SUE- 11  compiler  was 
implemented  by  modifying  the  SUE-060  compiler  enforces  this 
claim.  The  compiler,  in  fact,  is  implemented  so  as  to  allow 
the  development  of  a  compiler  for  another  machine  quickly 
and  easily.  We  have  implemented  the  SUE-11  compiler  in 
under  six  months,  of  which  about  two  months  were  spent  in 
becoming  familiar  with  the  SUE-360  compiler.  We  feel  that 
with  our  present  knowledge,  we  could  modify  the  compiler  to 
produce  code  for  another  machine  in  about  four  months. 

The  problems  of  the  SUE  System  Language  as  an 
implementation  language  for  other  minicomputers  are  similar 
to  the  problems  of  all  high  level  languages  of  trying  to  fit 
language  constructs  onto  a  machine  which  is  not  really 
designed  to  implement  them.  The  SUE  language  can,  however, 
be  fit  onto  a  machine  easier  than  most  high  level  languages 
because  of  its  machine  independent  constructs.  The  SU^ 
language  is  powerful  enough  so  that  the  majority  of 
operations  that  are  typically  required  may  be  easily  and 
elegantly  stated  within  the  language.  The  constructs  of  the 
SUE  language  are  designed  so  that  correct  and  understandable 
programs  are  produced.  These  are  usually  more  important 
criteria  than  a  slightly  more  efficient  code  sequence. 


Machine  independent  programs  can  certainly  be  written 
using  the  SUE  language.  If  compilers  for  the  SUE  language 
were  available  for  other  minicomputers,  programs  could  be 
easily  transferred  from  one  minicomputer  to  another.  It  is 
probably  true  that  programs  written  in  SUE  would  occupy  more 
space  than  the  corresponding  assembler  language  program,  but 
the  saving  in  development  time  should  offset  the  cost  of 
extra  memory  for  many  applications. 

We  have  shown  that  it  is  feasible  to  produce  a  compiler 
for  the  SUE  language  which  will  emit  code  for  a 
minicomputer.  Programs  for  the  PDP-11  have  been  translated 
by  the  SUE- 11  compiler,  and  executed  on  the  PDP-11 
simulator.  These  programs  include  simple  I/O  routines,  and 
the  SUE- 11  loader  which  loads  programs  produced  by  the  sum'¬ 
ll  compiler  onto  the  PDP-11.  We  hope  that  future  software 
development  will  prove  the  value  of  the  Sue  System  Language 
as  an  implementation,  language  for  minicomputers. 
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Appendix  I  -  The  SUF  System  Language  Grammar 


<comp ilat ion> 


I 

I 


<data  name>  ;  <data  block>  _| 
<context  name>  ;  <context  block>  _| 
<program  name>  ;  <scope  block>  _(_ 


<data  name> 


<data  name  head>  <parameters>  <returns> 


<data  name  head>  : := 

<parameters>  : := 

\ 


data  Cidentif ier> 

{  <identifier  list>  ) 
<e  mpty> 


<  retu  rns> 


returns  (  identifier  list>  ) 
<empty> 


identifier  list>  ::  = 

I 


<ident if ier> 

identifier  list>  ,  <identifier> 


iden  tif  ier> 


<compound  identifier> 
index  type  identifier> 
<constant  identifier> 
<procedure  identifier> 
<exit  identifier> 
<variable  identified 
<undeclared  identifier> 


<data  block> 


<definition>  ;  <data  block> 


<def inition> 


declaration 


declaration 


<template> 


<macro  head> 


<type> 


<record> 


<group  head> 


declaration 


|  <emptv> 


<declarat ion> 


\  <template> 


declare  <declaration  item> 


|  <declarat ion>  ,  <declaration  item> 


item>  <declaration  type>  (  <identifier  list>  ) 


<macro  head>  macro 


|  type  <identifier>  =  <type> 

|  constant  <ident.ifier>  =  <tuple> 


macro  <identifier>  <parameters> 


<declaration  type> 


I 


<record>  <field  list>  end 


1  droup  head>  ;  <field  list>  end 


record 


group  <brief  type> 


type>  ::=  Chrief  type> 


I 


area  <number> 


|  procedure  <parameter  type> 


<return  type> 
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1 

interrupt  handling  procedure 

Cbrief  t ype>  :  :  = 

<compound  type  identifier> 

1 

<index  type> 

I 

pointer  to  <identifier> 

1 

poverset  of  <index  type> 

I 

array  (  <index  list>  )  of  <brief  type> 

1 

<attribute>  <brief  type> 

<index  type>  ::= 

<index  type  identified 

1 

bit  (  <number>  ) 

1 

character  (  <number>  ) 

I 

(  Cidentifier  list>  ) 

1 

(  <expression>  to  <expression>  ) 

1 

(  <expression>  to  *  ) 

<constant>  :  :  = 

<unsigned  constant> 

I 

<adding  operator>  <nurcber> 

<unsigned  constant>  : 

:=  <number> 

1 

<string> 

1 

Cconstant  identifier> 

<index  list>  :  :  = 

<index  type> 

1 

<index  list>  ,  <index  type> 

<parameter  type>  :  :  = 

accepts  (  <t ype  list>  ) 

1 

<empty > 
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<return  type>  ::= 

returns  (  <type  list>  ) 

1 

<empty> 

<type  list>  ::= 

<declaration  type> 

1 

<type  list>  ,  <declaration  type> 

<f ield  list>  : := 

<field  declaration> 

! 

<variant  part> 

1 

> 

<field  declaration^  ,  <field  list> 

<field  declaration>  : 

:=  <type>  (  <identifier  list>  ) 

1 

<e  mpt y> 

<variant  part>  ::= 

<variant  head>  ;  Cvariant  list>  end 

<variant  head>  ::= 

case  <index  type>  tag  <identifier> 

<variant  list>  : := 

<else>  <field  list> 

1 

<variant  labels>  <field  list> 

1 

<variant  labels>  <field  list> 

<variant  list> 

<else>  : :  = 

el se  : 

< variant  labels>  ::= 

<label> 

1 

<variant  labels>  <label> 
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<label> 


:  :=  then  : 

|  <constant>  : 

|  <constant>  to  <constant>  : 


<a ttr ibute> 


: :=  register 

|  fast 

|  aligned 

1  aligned  ^number>  ,  <number> 


<context  name> 


context  <identifier> 


<context  block> 


<context  declaration> 

<context  block>  ;  <context  declaration^ 


<context  declaration> 

I 


: : =  <te  mpla te> 
absolute  (  <number> 
<ident if ier> 


<declaration  type> 


<prcgram  name> 


program  <procedure  identifier> 


<scope  block> 


I 

I 


<executable  statement  list> 

<def inition>  ;  <scope  block> 
<open  statement>  ;  <scope  block> 


<open  statement  ::=  open  <storage  reference> 

tement> 
st>  ; 


<executable  statement  list>  <executable  sta 


<executable  statement  li 
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<executahle  statement> 


^executable  statement>  ::=  <escape>  <label  part>  <condition> 


! 

<with  part> 

return  <from  part>  <condition> 

1 

<with  part> 

<selector>  ;  <alte rna tives>  en 

1 

<compound> 

1 

<tuple  elements 

! 

assert  <tuple  element> 

! 

<  e  ip  pt  y  > 

<escape> 

•  • 

ex  it 

I 

repeat 

<label  part> 

1 

<  Cexit  identifier>  > 

1 

<empt  y> 

<condition> 

•  *  — 

•  • 

unless  <expression> 

1 

when  <expression> 

I 

<empty> 

<with  part> 

•  •  ~ 

with  <tuple> 

1 

<empty> 

<from  part> 

•  • 

from  <procedure  identifier> 

I 

<empty> 
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<selector>  : := 

if  <expression> 

1 

case  <index  type>  tag  <expression> 

<alternat ives>  ::= 

<else>  Executable  statement  list> 

1 

<alternative  labels> 

<executable  statement  list> 

1 

<alternative  labels> 

<executable  statement  list>  ; 

<alternatives> 

Alternative  labels> 

: : =  <label> 

1 

<alternative  labels>  <label> 

<compound>  : : = 

begin  <scope  block>  end 

1 

cycle  Executable  statement  list>  end 

1 

<do  head>  ;  <executable  statement  list> 

end 

1 

<exit  label>  <compound> 

<  <exit  identifier>  > 

<do  head>  : : - 

do  <variable  identifier>  := 

<iteration  control> 

<iteration  control>  : : 

:=  <tuple> 

1 

<expression>  to  <expression> 

1 

<expression>  downto  <expression> 

1 

each  Cindex  type> 
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<exit  label> 


<  <identif ier>  > 


<tuple>  :  :  = 

<tuple  element> 

1 

<tuple>  ,  <tuple  element> 

<tuple  element>  :  :  = 

<value> 

1 

<index  type  identified 

1 

<conpound  type  identifier> 

<value>  :  :  = 

» 

<expression> 

1 

<storage  reference>  :=  <value> 

<expression>  :  :  = 

<logical  terin> 

1 

<expression>  <or  operator> 

<logical  term> 

<logical  term>  :  :  = 

<logical  factor> 

I 

<logical  term>  <logical  factor> 

<logical  factor>  : := 

<string  expression> 

1 

<logical  factor>  <relational  operator> 

^string  expression> 

<string  expression>  : : 

=  <numeric  expression> 

1 

<string  expression>  | | 

<numeric  expression> 

Cnumeric  expression>  : 

:=  <numeric  term> 
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I  Cnumeric  expression>  <adding  operator> 

Cnumeric  term> 

<nuieric  term>  : :=  Cnumeric  factor> 

|  -i  Cnumeric  factor> 

|  <adding  operator>  Cnumeric  factor> 

|  Cnumeric  ter ro>  <multipl ying  operator> 

Cnumeric  factor> 

Cnumeric  factor>  ::=  <storage  reference) 

|  <unsigned  corvstant> 

|  (  Ctuple>  ) 

<storage  reference)  ::=  Cvariable  identifier> 

|  <procedure  identifier!) 


I 

Cstorage  reference) 

a) 

1 

<storage  reference) 

.  <identifier> 

I 

<storage  reference) 

(  <tuple>  ) 

1 

typed  Cbrief  type>  ( 

Ctuple>  ) 

1 

Cescape  type>  ;  Ccom 

poun  d> 

<escape  type> 

•  •  — 

•  • 

exits  with  (  <type  1 

ist>  ) 

Cor  operator> 

•  •  — 

•  • 

1 

\ 

xor 

Relational  operator> 

-  •  =  < 

\ 

— 
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Adding  operator> 


! 


<multiplying  operator> 


Appendix  II 


Cample  SUE-11  Programs 


1.  I/O  Routines  and  Simple  PL/I  Paragrapher 

1  _ CO  4 TEXT  MUff  AT  ;  _  _  _  _ 

2  type  ui_o.srATus  =  array  ((3  ru  m  of  bitiks); 

3  TYPE  if PRINT.LINE  =  ARRAY  ((J  TO  132))  OF  HIT  (3); 

4  TYPE  $iCARO_lMAGE  =  APR  AY  ((3  TO  79))  OF  HIM  3 )  ; 


5  TYPE  R  ;  3  1  S  T  2  R  _  K I  TS  =  (ERR_)IT,  BITi't,  8IT13,  UT12,  3USY_8lT,  A  I T  L  0  ,  BIT?,  ftl  T  6 , 

6  DJ\lfc_BIT,  1NT_3IT,  BIT1),  3iT4,  UT?,  HI  T  2 ,  3  I  T 1  ,  G0_3IT); 

7  TYPE  ST  MDS_R2G  I  STEP  =  M'l-liCT  )F  REG  I  5  T  2R_B  ITS  ; 

3  A3  S-jLU  I  F  (63334)  S  TA  T  JS_  R  r  » I  j  Tr  R  PKS; 

"  9'  A  IS  .L  Jl  L"  (65386)  Bin  3)  ?  :R3  ; 

10  A  .1 S  ’!  L '  J  T  h  (o5338>  ST  A  TU5_RE3  I  STEP  PPS; 

11  A  3  3  J  L  U  T  L  (  4  5  3  9  J  )  317(3)  PP.B; 

12  ABSOLUTE  ( 04334)  BIT (16)  Pi; 

13  C INSTAmT  LIME_r£EO  =  "5o";  /  •»  i  SIGN  */ 

14  CONSTANT  YEr'  T  _T  A  B  =  "r>  =  ";  /*  >  */ 

1*>  C  0  J  6  T  A  \  T  F  it-  M_FE  r.  ,)’=  '  *  7 1  ’  ‘  ’  ;  /*  •/  -  / 

16  CONSTANT  C ARRIAGE_RE TU^N  =  "VC";  /*  <  */ 

17  MACRO  J  ET  _CQN  TROL  (  M*  ATUS _R E 3 ,  S  A  V  E_  S  TAT JS )  ; 

15  SaVE.STATUS  (0)  :=  PS; 

19  _ _ CYCLE  _  _ _ _ _ _ 

20  '  PS  :=  0; 

21  CYCLE 

22  ....  EXIT  UNLESS  3JSY_5IT  <=  STAT'J5_RE3; 

23  ENO; 


24  PS  :=  128; 

25  ....  EXIT  UNLESS  BUSY.3IT  <=  STATUS_«E3; 

26  cNrJ; 

27  SAVE.STATUS  (1)  :^=  TY-^EO  I3IT(16)  ( 3TATUS_RE3 ) ; 

28  _ TYPE,)  81  T  (  16  )  (  S  T  AT'J IS  _  3  G 3  ] I  :  =  0  _  _ _ 

29  '  EMD  MACRO; 


30  MACRO  \  EL  I  NO  J  I  SH_C0N  TR  JL  (  S  T  \TUS_REG»  SAV£_ST  AT'JS  )  5 

31  TYPED  BIT  (16)  (  STATU S_8E5 )  :=  SAVE_STATUS  (1); 

32  Pj  :=  S A ‘/ E _ S T A T •  J S  (3) 

33  END  MACRO;  _ 


34  MACRO  SET  (6UFFADDR); 


35 

36 

37 
33 

39 

40 

41 

42 

43 


B  c  S  I  N 

DECLARE 

_ _  3IT( 16  )  (  I  )  , 

d  l  T  (  8  )  (CHAO, 

OLD_S  TA  TUS  (SAVE_3RS), 
POINTER  TIJ  »iCARD_ IMAGE 
GET_CONTR!JL(PRS,  SAVE.PRS); 
CARD  :=  3UFFAD0R; 

I  ;  =  j  ; 


( CARD)  ; 
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44 
4  4 
4o 


47 
4  3 
49 


5  J 

51 

52 

53 

54 


55 

56 

57 

5  3 

59 

60 
61 

62 

6  3 

64 

65 
64 
67 

63 

69 

70 

71 

“72 

73 

74 

75 

76 

7  7 
73 

79 


<i:OLLECTION_LCMP> 

CYCLE 

PRS  :=  PRS  I  00_ri I T ; 

CYCLE 

....  EXIT  WHEN  D0NE_KSIT  <=  PRS  5 
END: 


if  ERi_4ir  <=  prs; 

THEM  : 

c a  I  T  COLLECT  I0M_LDnP> 

END; 

(Thar"  prIsT 

IF  CHAR  =  LI ME_FEE  0; 

THEN  : 

IF  I>  0; 

T  HE  N ; 

TFTi.waTl-:Tr ^TrATTTcIre'tIJrn' 

then:  I  :=  I  -  1 
END 

END 


END; 

IF  CHAR  =  LIN5_FEEJ; 

THEN  : 

....  EXIT  COLLECT  IQN_LOOP> 

END; 


IF  CHAR  =  FJR1_FEEl); 

THEN  : 

....  EXIT  COLLECT  I  UN_LL'(jP> 

END; 

FTTH  AR~ ^“vFrTIT a' til 
THEN  : 

....  EXIT  COLLECT  ION_LO(1P> 

END; 

CARD 3 (  I )  : =  C  UR ; 
r  ;=  T+'l;" 

.  ...  EXIT  <  COLLECT  I  ON_L  9CJP>  WHEN  I>  79; 
ENO  CuLL  iCT  I  0  i__  OOP>  ; 


80 

81 

82 

83 

84 

85 

86 


CYCLE 

....  EXIT  WHEN  I>  79; 

CAR  53(1)  jV-oLANCCT 

I  :=  I  +  l; 

END; 


REL  INOU  I  SH_COf.TROL  (  PRS,  SAVE_PRS> 

EMO 
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87 

E.MD  MA^RH; 

88 

89 

90 

91 

MACRO  PJT(  OUFFADLJR)  ; 

BEGIN 

DECLARE 

a  I  t  ( 1 6 )  ( i .  j  ) , 

9^ 

93 

94 

95 

95 

97 

0  LD_S  T  AT  US  IS-WE.PPSI, 

POINTER  TO  fe5PRINT_L INE  (CARD); 
GET_CONTR.3L  (  EPS  »  S4VE_DPS  )  ; 

CARD  :=  3UF  FADOR; 

I  :=  ); 

J  :=  132; 

98 

CYCLE 

99 

....  EXIT  WHEN  C AR03( J )  R  L  A  J  K  ; 

00 

....  EXIT  WHEN  J  C  0 ; 

01 

J  :  =  J  —  1 ; 

02 

END; 

• 

0  3 

CYCLE 

04 

....  EXIT  WHEN  I>  J; 

05 

PP3  : =  CAR  DR (  I ) ; 

06 

CYCLE 

07 

....  EXIT  WHEN  DON  E_R I T  <=  PPS  ; 

08 

end; 

" 

09 

I  :=  I  +  li 

10 

END; 

a 

/*  PPQ  :=  CA'RRI  AGE.RET  JRN; 

Y 

L2 

*  CYCLE  EXIT  W  IEN  DONE.EIT  <=  PPS; 

* 

13 

♦  END; 

r 

1 4 

♦  PP3  :=  LINE.FEED; 

£ 

.5 

*  CYCLE  EXIT  WIEN  DONE.RIT  <=  PPS; 

a. 

.6 

*  end; 

-A* 

.7 

REL  INDUl  SH_CO'-JT  RJL  (  PPS.  S  AVE_PPS) 

3 

END 

9 

END  M A C  R 0 

0 

- 
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121  3 A  F A  MOFFAT; 


122  TYPE  KtY_TYPES  =  ( PL I_GROlP ,  PL  I _THFN ,  PLI.FLSF); 

123  TYPE  BYTE  =  QIT(8); 

124  TYPE  WJRD  =  BI T(  16) ; 


125  TYPE  CJT.ARRAY  =  ARRAY  1(0  TO  132))  OF  BYTE; 

12t>  CONSTANT  ALA  K  =  "40"; 

12  I  CONSTANT  COLON  =  "7A»; 

123  CONSTANT  S  EM  I  _  C  D  L  U  N  =  "5E"; 

123  CONSTANT  A  =  "CL"; 

13  3  "CONSTA'NT  L  =  "E 

131  CONST  AnT  L  =  "i)3"; 

132  CONSTANT  S  =  "E2"; 

133  CONSTANT  E  =  " C5" ; 

134  CONSTANT  D  =  "C4"; 

135  CONSTANT  0  =  "05"; 

‘  1 36  CO  N  ST  A  n’T  N  =  "  0  5  "  \ 

137  CONSTANT  T  =  "E3 " ; 

138  CONSTANT  H  =  "CB"; 

139  CONSTANT  INDENT  =  "00"; 

14  J  CONSTANT  EXDENT  =  "01"; 

141  CONSTANT  NE  W_L I N t  =  "0A»; 

142"  "  CONSTmN  T~BYTS_MASN  =  "  FF"  ; 

143  DECLARE 

144  ARRAY  (()  TO  2J))  OF  BYTE  (  SPE  C_PIJFF  )  , 

145  ARRAY  UJ  TO  3))  OF  BYTE  ( K.E  Y_h'JFF:  )  » 

1^6  WORD  (SPEC_PTR,  KEY_PTA,  GATHERING,  SKIPPING,  ST  K_°TR ,  IN.PTR), 

147  WORD  (  BL  ANK_S'<  I  P  !■*  I  NG  ,  OJT.PTR), 

148  “  ARRAY"  ((0  “TO  19))  OF  <FY_TYPES  (  TY°E_Sf  AC<  )  , 

149  PROCEDURE  RETURNS  (BYTE)  (PARAGRAPHS  0_CY  TE ) , 

150  WORD  (K,  TAB,  TEMP), 

151  POINTER  TO  LJUT_  \RRAY  (  0  J  T  3U  F_P  TR  )  , 

152  $$CARD_I  MAGE  (INBIJF), 

153  i*PRINT_L  INF  (3UTI3UF); 

15  4  _TT 
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155  DAT*  PARAGR*PHED_HYTH_  REJHRNS  ( _TE  1PA  )  ; 

155  '  DEC L Arte 

157  PROCEDURE  RETURNS  (BYTe)  (  GE  F_  SJIJRC  E_BY  T  E  )  * 

158  WORD  (TEMP); 

159  OECLAKc 

160  WORD  (CO,  Cl,  C2,  CES ) ; 

161  MACRO  P  I S  H  (  P  A  P  v  -1  )  ; 

162""  "  "  “T  i  ?C_  S  T  ACTCS  Vi  _*PTR  :  =  ST  <_PT  R  +“l)  ;=  PAKV-' 

lo8  END  MAC R 0 ; 

164  MACRO  POP; 

165  S T ’< _ P T K  STK_PTR  -  1 

165  END  MA^RD; 


167  4  ACR  0  jPEC_P'JSH  (  PAR  AM  )  ; 

168  S J  EC_dUFE  (SPEC_PTR  :=  $PHC_PTR  +  1)  :  =  “ARM 

169  END  MACRO! 

179  MACRO  SPEC  I AL_CH AK_C HECK ; 


171  IF  TEMP  =  COLON; 

172  THEN:  PUSH  (  PL  l  _(VROLP  )  ; 

17  8  SPEC_PUSH(  IriOt'lT)  ; 

174  BLANK_SKIPP ING  :  =  1; 

175  _ _ _ E_LS  E:  _ , _ _ _ 

17 6 ”  "  I F  T E  M P  =  S  E H  I  _C  jw  9N ; 

177  THEN:  SPEC_PU5H(  1EW_L INE ) ; 

17P  CASE  KEY_TYPES  TAG  T  YP  E_ST  ACM  STK_  p  T  R  )  ; 

179  PLI_THEN: 

180  SPEC_PJSH ( EXDENT)  ; 

181  pup; 

182  P L I _E L Sl  : 

188  SPEC_PJSH( EXDENT) ; 

184  POP; 

18o  CYCLE  _ 

Ido  '  “  TYPE_ST  AC  <  (  ST  <  _PTR  )  =  PLI  .GROUP; 

187  SPEC_PUSH ( EXOENT ) ; 

188  IF  TYPE.ST ACK( STK.PTR)  =  PL  I _ THEN; 

189  THEN:  pop; 

19  9  ....  EX  I  T _ 

191  '  ’  "  "  ‘  end; 

192  pjp; 

198  END 

194  END;  _ 


195  BLANK_SKIPPI NG  :=  1; 

195  ELSE: 

197  IF  TEMP  =  BLANK; 

198  THEN: 

199  IF  oLA.N<_SKIPPING; 
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200 

201 

202 

203 


THEN: 

K£Pt AT  <L0JP>; 

ELSE:  8LANK_SKIPPING  :=  1 
END ; 


204  ELSE: 

205  IF  TEM9>  t3L AMK  ; 

2  05'  '  T H c  M  r  OLA'I  kIS  KTP'PT  NO  :  =~ O' 


207 


END 


208  END 

2 09  END 


210 

E  ID 

211 

END  MACRO; 

212  MACRO  <  EY  WOR  J_CHECK ; 


213 

214 

215 

21-3 

217 

IF  KEY_PTR>  0; 

THEN:  Cj  :=  KEY_P'JFFl  0)  £  rfYTE_MASK; 

Cl  :=  <EY_  BUFF ( 1 )  £  BYT  E_HASK ; 

C2  :=  KE Y_b JFF ( 2 )  £  BYTE. MASK; 

C 3  :=  KEY_ BUFFI  3)  £  B YT E_MA  SK ; 

218 

IF  C3  =  E; 

219 

THEN  : 

22  0 

IF  Cl  =  N  £  C2  =  0  £  KEY_PTK  = 

2; 

221 

THEN:  SPLC.^USH ( TEMP) ; 

222 

TEMP  :=  EXDENT; 

223 

POP; 

22  4 

ELSE; 

225 

IF  Cl  =  L  £  C2  =  S  £  C3  = 

E  £ 

K E  Y _  PT R  =  3; 

22t> 

Then:  push i pl I _ELS E ) ; 

227 

S?EC_PUSH(  INDENT ) 

228 

END 

229 

END; 

233 

ELSE  : 

231 

IF  CO  =  0  £  Cl  =  3  £  KE Y_PTR  = 

l; 

232 

THEN:  SPEC_PUSH( TEMP ) ; 

233 

TEMP  :=  INDENT; 

234 

PUSH! PLI.GROUP) ; 

235 

ELSE  ; 

23o 

IF  CO  =  T  £  Cl  =  H  £  C2  = 

E  £ 

C3  =  N  £  <  EY_PTR  =  3; 

237 

THEN:  SPEC_PU->H  (  TH*'P)  ; 

233 

S  P  E  C  _P  J  S  H  (  NEWSLINE)  ; 

239 

TEMP  :=  rUOeNTT" 

243 

PJSHIPLI.THEN) ; 

241 

BlANK_SK IP  PING  :=  1 

242 

END 

243 

END 
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2 44  END 


245 

END 

24  D 

# 

END  MAoRO; 

247 

l 

24  3 

DATA  GE  T_SD JRCE_bY TF  RET JRNS  (  S  J  J RC E _C H  Ak  )  ; 

247 

i  

250 

PROGRAM  GET_S3LJRCE_3YTE; 

• 

251 

252 

253 

254 

255 

if  i n_?tp  =  eo; 

THEN:  GET(INliUF); 

IN_PTR  :=  0; 

ELSE:  I N_PTR  :  =  IN_PTR  ♦  1 

END; 

255 

257 

....  RETURN  WITH  TYPED  ARRAY  (  ( 0  T3  79))  OF  BIT  (3)  (INBJF)  (  I N_P TR ) ; 

_l_ 

- 

• 

- 

•  ♦. 
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253  PIJGK  A'-’  PARAGRAPHS  ')_iiYTE: 

259  '  <LHOP> 

26  0  BEGIN 

261  IF  SPEC_PTR>=  D; 

2r>  2  THEM  :  TEMP  :  =  SPFC_bJFF ( SPEC.PTR ) i 

263  SPEC_PTR  :  =  SPF.C.PTP.  -  1; 

2  6V  ELSE- f~  T  EM  P  ~T=  GfO  JTRCETJb  V  ft ; 

266  TEMP  :=  TEMP  C  BY  T  E_M ASK ; 

265  IT  GATHERING; 

267  THEN  : 

268  IF  TE  M P>=  A  £  TEMP  <=  L\ 

26  9  THEN:  ~ 

27  0  IF  KEY.PTR  =  3; 

271  THEM;  GATHERING  :=  0; 

272  SKIPPING  :=  l; 

273  ELSE:  KEY_6UFF(KEY_PTR  :=  KEY.PTR  +  1)  :=  TE-1P 

274  END: 


275  ELSE:  GATHERING  :=  0; 

27b  IF  TEMP  <  A; 

277  THEN:  KEYWjRD_CHECK ; 

276  GATHERING  :  =  0; 

279  '  '  SKIPPING  : =  0; 

280  ELSE:  SKIPPING  :=  1 

281  END 

282  END; 


283  ELSE: 

284  IF  SK IPP  IMG; 

285  THEN: 

285  IF  TEMP  <  A; 

287  THEN :  SKIPPING  :  =  0 

283  END; 


289  ELSE: 

29D  IF  TE  MP>=  A  L  TEMP  <=  Z; 

291  THEN:  GATHERING  :=  1; 

292  KtY.PTR  :=  0; 

293  KE Y_8 JFF (0 )  :  =  TEMP; 

294  END 

295  END 

295  END 

2  9  7  ~ "  ew  r~ 

293  SPEC  I AL_CHAR_CHECK; 

299  END  <LJ0P>; 

30 J  ....  RETJRN  WITH  TEMP; 
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30  L 


302 

P  R  3  GP.  A  B  J  F  r  A  T  ; 

303 

DO  I  N_P  T  R 

304 

P  R  S  : 

30  5 

END; 

30o 

S  P  t  C  _  P  T  'R  : 

307 

IN_£>Tft  :  = 

303 

GATHERING 

309 

STK.PTR  := 

310 

TYPE_ST  ACK 

311 

KEY.PTR  := 

312 

JUTBUF.PT, R 

313 

on  OUT.PTR 

314 

QjTBJ 

315 

END ; 

315 

‘OUT.PTR  :  = 

317 

TAB  :=  0; 

313 

03  K  :=  1 

319 

- 

TcMP 

320 

IF  TE 

321 

THE 

322 

ELS 

323 

324 

3  2  5 _ 

=  p 


L I  _GRUJ  p ; 


BLANK; 


N:  TAB  :=  TAh  ♦  5; 

E: 

IE  TEMP  =  EX0E.MT; 

THtN  :  TAB  :=  T  A  3  -  5 i 
ELSE:  JUTBUF_PTRO( JUT_PTR 


=  JUT  PTR  +  1)  :=  TE’-IP; 


32  0 

32  7 


IF  TEMP  =  ULK_Ll\5; 

THEN:  pjti jjtbuf ) ; 


323 

329 

330 


0  0  L'U T _ P T R  :=  1  TU  132; 

CJ  J  T3UF  _P  TR-ii  (  0UT_PTc  ) 

END; 


:=  BLANK; 


331 
33  2 


GUT_^T<  :=  TAB; 


END 


33  3 

334 

335 


END 


END; 


END 


i$E  OF 

~  3  3t>  '  zrz 
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